Hog cholera virus vaccine and diagnostic

ABSTRACT

The present invention is concerned with a hog cholera virus vaccine comprising a polypeptide characteristic of hog cholera virus. Vector vaccines capable to express a nucleic acid sequence encoding such a polypeptide also form part of the present invention. Said polypeptide and nucleic acid sequence can also be used for the detection of hog cholera virus infection.

This is a continuation of application U.S. Ser. No. 08/747,577, filed Nov. 7, 1996, now abandoned, which is a continuation of U.S. Ser. No. 08/650,584, filed May 20, 1996, now abandoned, which is a continuation of Ser. No. 08/469,702, filed Jun. 6, 1995, now abandoned, which is a continuation of U.S. Ser. No. 08/123,596, filed Sep. 20, 1993, now abandoned, which is a continuation of U.S. Ser. No. 07/797,554, filed Nov. 22, 1991, now abandoned, which is a continuation-in-part of U.S. Ser. No. 07/494,991, filed Mar. 16, 1990, now abandoned.

The present invention is concerned with a nucleic acid sequence, a recombinant nucleic acid molecule comprising such a nucleic acid sequence, a recombinant expression system comprising such a recombinant nucleic acid molecule, a polypeptide characteristic of the hog cholera virus, a vaccine comprising such a polypeptide or recombinant expression system as well as a method for the preparation of such vaccines.

Classical swine fever or hog cholera (HC) represents an economically important disease of swine in many countries worldwide. Under natural conditions, the pig is the only animal known to be susceptible to HC. Hog cholera is a highly contagious disease which causes degeneration in the walls of capillaries, resulting in hemorrhages and necrosis of the internal organs. In the first instance hog cholera is characterized by fever, anorexia, vomiting and diarrhea which can be followed by a chronic course of the disease characterized by infertility, abortion and weak offsprings of sows. However, nearly all pigs die within 2 weeks after the first symptoms appear.

The causative agent, the hog cholera virus (HCV) has been shown to be structurally and serologically related to bovine viral diarrhea virus (BVDV) of cattle and to border disease virus (BDV) of sheep. These viruses are grouped together into the genus pestivirus within the family togaviridae. The nature of the genetic material of pestiviruses has long been known to be RNA, i.e. positive-strand RNA which lacks significant polyadenylation. The HCV probably comprises 3-5 structural proteins of which two are possibly glycosylated. The number of non-structural viral proteins is unknown.

Modified HCV vaccines (comprising attenuated or killed viruses) for combating HC infection have been developed and are presently used. However, infection of tissue culture cells to obtain HCV material to be used in said modified virus vaccines, leads to low virus yields and the virions are hard to purify. Modified live virus vaccines always involve the risk of inoculating animals with partially attenuated pathogenic HCV which is still pathogenic and can cause disease in the inoculated animal or offspring and of contamination by other viruses in the vaccine. In addition the attenuated virus may revert to a virulent state.

There are also several disadvantages using inactivated vaccines, e.g. the risk of only partial inactivation of viruses, the problem that only a low level of immunity is achieved requiring additional immunizations and the problem that antigenic determinants are altered by the inactivation treatment leaving the inactivated virus less immunogenic.

Furthermore, the usage of modified HCV vaccines is not suited for eradication programs.

Until now, according to our knowledge diagnostic tests in swine which can distinguish between HCV or BVDV infection are not available. This is important as BVDV infection in pigs is of lower significance than HCV infection which means that BVDV infected pigs do not have to be eradicated.

Vaccines containing only the necessary and relevant HCV immunogenic material which is capable of eliciting an immune response against the pathogen do not display abovementioned disadvantages of modified vaccines.

According to the present invention a nucleic acid sequence encoding a polypeptide characteristic of hog cholera virus has been found. Fragments of said nucleic acid sequence or said polypeptide are also within the present invention. Both the nucleic acid sequence and the polypeptide or fragments thereof can be used for the preparation of a vaccine containing only the necessary and relevant immunogenic material for immunizing animals against HCV infection. "Nucleic acid sequence" refers both to a ribonucleic acid sequence and a deoxy-ribonucleic acid sequence.

A nucleic acid sequence according to the present invention is shown in SEQ ID NO:1!. As is well known in the art, the degeneracy of the genetic code permits substitution of bases in a codon resulting in an other codon but still coding for the same amino acid, e.g. the codon for the amino acid glutamic acid is both GAT and GAA. Consequently, it is clear that for the expression of a polypeptide with the amino acid sequence shown in SEQ ID NOS:1 and 2! use can be made of a nucleic acid sequence with such an alternative codon composition different from the nucleic acid sequence shown in SEQ ID NO:1!.

Also included within the scope of the invention are nucleic acid sequences which hybridize under stringent conditions to the nucleic acid sequence shown in SEQ ID NO:1!. These nucleic acid sequences are related, to the nucleic acid sequence shown in SEQ ID NO:1! but may comprise nucleotide substitutions, mutations, insertions, deletions etc. and encode polypeptides which are functionally equivalent to the polypeptide shown in SEQ ID NOS:1 and 2!, i.e. the amino acid sequence of a related polypeptide is not identical with the amino acid sequence shown in SEQ ID NOS:1 and 2! but features corresponding immunological properties characteristic for HCV.

Within the scope of the invention are also polypeptides encoded by such related nucleic acid sequences.

The nucleic acid sequence shown in SEQ ID NO:1! is a cDNA sequence derived from the genomic RNA of HCV. This continuous sequence is 12284 nucleotides in length, and contains one long open reading frame (ORF), starting with the ATG codon at position 364 to 366 and ending with a TGA codon as a translational stop codon at position 12058 to 12060. This ORF consists of 3898 codons capable of encoding 435 kDa of protein.

In vivo, during HCV replication in an infected cell, this protein is synthesized as a polyprotein precursor molecule which is subsequently processed to fragment polypeptides by (enzymatic) cleavage of the precursor molecule. These fragments form after possible post-translational modifications the structural and non-structural proteins of the virus. A preferred nucleic acid sequence contains the genetic information for such a fragment with immunizing properties against HCV or immunological properties characteristic for HCV or contains the genetic information for a portion of such a fragment which still has the immunizing properties or the immunological properties characteristic for HCV.

The term "fragment or portion" as used herein means a DNA or amino acid sequence comprising a subsequence of one of the nucleic acid sequences or polypeptides of the invention. Said fragment or portion is or encodes a polypeptide having one or more immunoreactive and/or antigenic determinants of a HCV polypeptide, i.e. has one or more epitopes which are capable of eliciting an immune response in pigs and/or is capable of specifically binding to a complementary antibody. Such epitope containing sequences are at least 5-8 residues long (Geysen, H. M. et al., 1987). Methods for determining usable polypeptide fragments are outlined below. Fragments or portions can inter alia be produced by enzymatic cleavage of precursor molecules, using restriction endonucleases for the DNA and proteases for the polypeptides. Other methods include chemical synthesis of the fragments or the expression of polypeptide fragments by DNA fragments.

Fragment polypeptides of the polypeptide according to SEQ ID NOS:1 and 2! and the portions thereof, which can be used for the immunization of animals against HC or for diagnosis of HC also form part of the present invention. A fragment-coding region is located within the amino acid position about 1-249, 263-487, 488-688 or 689-1067. The 1-249 region essentially represents the core protein whereas the 263-487, 488-688 and 689-1067 regions essentially represent glycoproteins of 44/48 kD, 33 kD and 55 kD respectively. Within the scope of the invention are also nucleic acid sequences comprising the genetic information for one or more of the coding regions mentioned above or portions thereof.

A preferred region to be incorporated into a vaccine against HCV infection is the region corresponding to the 55 kD protein of HCV or a portion thereof still having immunizing activity.

Furthermore, a nucleic acid sequence at least comprising the coding sequences for said 55 kD protein or portion thereof can advantageously be applied according to the present invention.

In addition, a preferred portion of the HCV 55 kD protein, which can be used for immunization of pigs against HCV infection, is determined by analyses of HCV deletion mutants with anti-55 kD protein monoclonal antibodies having virus neutralizing activity. Such a portion comprising an epitope spans the amino acid sequence about 812-859 and is coded by the nucleotide sequence about 2799-2938. A polypeptide at least comprising said amino acid sequence or a nucleic acid sequence at least comprising said nucleotide sequence form part of the present invention too.

A nucleic acid sequence according to the invention which can be used for the diagnosis of HCV infection in pigs and which can be applied to discriminate HCV from BVDV can be derived from the gene encoding the 55 kD protein.

Preferably, such a nucleic acid sequence is derived from the nucleotide sequences 2587-2619 or 2842-2880, both sequences being part of the gene encoding the 55 kD protein. A preferred oligonucleotide for diagnostic purposes is (SEQ ID NO: 3 and 4, respectively):

5'-CCT ACT AAC CAC GTT AAG TGC TGT GAC TTT AAA-3' or

5'-TTC TGT TCT CAA GGT TGT GGG GCT CAC TGC TGT GCA CTC-3'

Moreover, a nucleic acid sequence comprising at least a sub-sequence of said oligonucleotides and which still can be used to differentiate between HCV and BVDV forms part of the invention.

The invention also relates to a test kit to be used in an assay, this test kit containing a nucleic acid sequence according to the invention.

Preferably the test kit comprises an oligonucleotide mentioned above or a nucleic acid sequence comprising at least a sub-sequence thereof.

Variations or modifications in the polypeptide shown in SEQ. ID NOS:1 and 2! or fragments thereof, such as natural variations between different strains or other derivatives, are possible while retaining the same immunologic properties. These variations may be demonstrated by (an) amino acid differences) in the overall sequence or by deletions, substitutions, insertions, inversions or additions of (an) amino acid(s) in said polypeptide.

Moreover, the potential exists, in the use of recombinant DNA technology, for the preparation of various derivatives of the polypeptide shown in SEQ ID NOS:1 and 2! or fragments thereof, variously modified by resultant single or multiple amino acid substitutions, deletions, additions or replacements, for example by means of site directed mutagenesis of the underlying DNA. All such modifications resulting in derivatives of the polypeptide shown in SEQ ID NOS:1 and 2! or fragments thereof are included within the scope of the present invention so long as the essential characteristic activity of said polypeptide or fragment thereof, remains unaffected in essence.

RNA isolated from pelleted virions was isolated and used for the synthesis of cDNA. This cDNA was cloned in phage λgt11 and the respective library was amplified and screened with goat anti-HCV antiserum. Two positive clones could be identified and shown to have inserts with sizes of 0.8 kb and 1.8 kb. The 0.8 kb λgt11 insert was partially sequenced (see SEQ ID NOS:12 and 13!) and determined to be located between about 1.2 and 2.0 kb on the HCV genome (see SEQ ID NO:1!).

A nucleic acid sequence according to the invention which can be used for the diagnosis of HCV in infected animals and which surprisingly can be applied to discriminate HCV from BVDV is represented by the nucleotide sequence 5551-5793 shown in SEQ ID NO:1!.

Moreover, a nucleic acid sequence comprising at least a sub-sequence of said nucleotide sequence and which still can be used to differentiate between HCV and BVDV forms part of the invention.

The invention also relates to a test kit to be used in an assay, this test kit containing a nucleic acid sequence according to the invention.

Preferably the test kit comprises the nucleic acid sequence represented by the nucleotide sequence 5551-5793 shown in SEQ ID NO:1! or a nucleic acid sequence comprising at least a sub-sequence thereof mentioned above.

RNA isolated from pelleted virions was isolated and used for the synthesis of cDNA. This cDNA was cloned in phage λgt11 and the respective library was amplified and screened with goat anti-HCV antiserum.

Two positive clones could be identified and shown to have inserts with sizes of 0.8 kb and 1.8 kb. The 0.8 kb λgt11 insert was partially sequenced (see SEQ ID NOS:12 and 13!), and determined to be located between about 1.2 and 2.0 kb on the HCV genome (see SEQ ID NO:1!).

A nucleic acid sequence according to the present invention can be ligated to various vector nucleic acid molecules such as plasmid DNA, bacteriophage DNA or viral DNA to form a recombinant nucleic acid molecule. The vector nucleic acid molecules preferably contain DNA sequences to initiate, control and terminate transcription and translation. A recombinant expression system comprising a host containing such a recombinant nucleic acid molecule can be used to allow for a nucleic acid sequence according to the present invention to express a polypeptide encoded by said nucleic acid sequence. The host of above-mentioned recombinant expression system can be of procaryotic origin, e.g. bacteria such as E.coli, B.subtilis and Pseudomonas, viruses such as vaccinia and fowl pox virus or eucaryotic origin such as yeasts or higher eucaryotic cells such as insect, plant or animal cells.

Immunization of animals against HC can, for example, be achieved by administering to the animal a polypeptide according to the invention as a so-called subunit vaccine. The subunit vaccine according to the invention comprises a polypeptide generally in a pure form, optionally in the presence of a pharmaceutically acceptable carrier.

Small fragments are preferably conjugated to carrier molecules in order to raise their immunogenicity. Suitable carriers for this purpose are macromolecules, such as natural polymers (proteins, like key hole limpet hemocyanin, albumin, toxins), synthetic polymers like polyamino acids (polylysine, polyalanine), or micelles of amphiphilic compounds like saponins. Alternatively these fragments may be provided as polymers thereof, preferably linear polymers. Polypeptides to be used in such subunit vaccines can be prepared by methods known in the art, e.g. by isolating said polypeptides from hog cholera virus, by recombinant DNA techniques or by chemical synthesis.

If required the polypeptides according to the invention to be used in a vaccine can be modified in vitro or in vivo, for example by glycosylation, amidation, carboxylation or phosphorylation.

An alternative to subunit vaccines are "vector" vaccines. A nucleic acid sequence according to the invention is integrated by recombinant techniques into the genetic material of another microorganism (e.g. virus or bacterium) thereby enabling the microorganism to express a polypeptide according to the invention. This recombinant expression system is administered to the animal to be immunized whereafter it replicates in the inoculated animal and expresses the polypeptide resulting in the stimulation of the immune system of the animal. Suitable examples of vaccine vectors are pox viruses (such as vaccinia, cow pox, rabbit pox), avian pox viruses (such as fowl pox virus) pseudorabies virus, adeno viruses, influenza viruses, bacteriophages or bacteria (such as Escherichia coli and Salmonella).

The recombinant expression system having a nucleic acid sequence according to the invention inserted in its nucleic acid sequence can for example be grown in a cell culture and can if desired be harvested from the infected cells and formed to a vaccine optionally in a lyophilized form. Said genetically manipulated microorganism can also be harvested from live animals infected with said microorganism. Abovementioned recombinant expression system can also be propagated in a cell culture expressing a polypeptide according to the invention, whereafter the polypeptide is isolated from the culture.

A vaccine comprising a polypeptide or a recombinant expression system according to the present invention can be prepared by procedures well-known in the art for such vaccines. A vaccine according to the invention can consist inter alia of whole host, host extract, partially or completely purified polypeptide or a partially or completely purified recombinant expression system as above-mentioned.

The vaccine according to the invention can be administered in a conventional active immunization scheme: single or repeated administration in a manner compatible with the dosage formulation and in such amount as will be therapeutically effective and immunogenic. The administration of the vaccine can be done, e.g. intradermally, subcutaneously, intramusculary, intravenously or intranasally. For parenteral administration the vaccines may additionally contain a suitable carrier, e.g. water, saline or buffer solution with or without adjuvants, stabilizers, solubilizers, emulsifiers etc.

The vaccine may additionally contain immunogens related to other diseases or nucleic acid sequences encoding these immunogens like antigens of parvovirus, pseudorabies virus, swine influenza virus, TGE virus, rotavirus, Escherichia coli, Bordetella, Pasteurella, Erysipelas etc. to produce a multivalent vaccine.

Polypeptides according to the present invention can also be used in diagnostic methods to detect the presence of HCV antigen or antibody in an animal. Moreover, nucleic acid sequences according to the invention can be used to produce polypeptides to be used in above-mentioned diagnostic methods or as a hybridization probe for the detection of the presence of HCV nucleic acid in a sample.

EXAMPLE 1 Immunological identification of cDNA clones

Infection of cells and harvesting of virus. PK15 and 38A₁ D cells were grown in DMEM with 10% FCS and were infected in suspension by the virulent HCV strain Alfort in a volume of 20-30 ml at a cell concentration of 5×10⁷ /ml at 37° C. for 90 min with an m.o.i. of 0.01 to 0.001 (as determined by immunofluorescence assay). Thereafter, the PK15 cells were seeded in tissue culture plates (150 mm diameter), while the suspension cells 38A₁ D were incubated in bottles with gentle stirring (Tecnomara, Switzerland). For cDNA synthesis, the tissue culture supernatant was harvested 48 hours after infection, clarified at 12,000 g, and afterwards the virus pelleted in a TFA 20 rotor (Contron, Italy) at 54,000 g for 12 hours.

Preparation of goat anti-HCV serum. A fibroblastic cell strain was established from the skin biopsy of a young goat by standard cell culture techniques. The cells were initially grown in F-10 medium with 10% FCS and later in DMEM with 10% FCS. Goat fibroblasts were infected with HCV. Over the first 26 hours p.i., the cells were washed every 8 hours 3 times with PBS and afterwards incubated in DMEM with 10% preimmune goat serum (PGS). 48 hours p.i., the tissue culture supernatant was harvested and used as stock virus. Before immunization, goat cells for 30 tissue culture dishes (150 mm diameter) were kept for 3 passages in medium with 10% PGS and then infected with the stock virus. 48 hours p.i., the goat was immunized with X-ray-inactivated pelleted virus and infected cells. Both were emulsified in Freund's adjuvant (complete for basis immunization, incomplete for booster injections) and injected subcutaneously. To obtain antibodies recognizing denatured molecules, the antigen preparations were incubated in 0.2% SDS, 3 mM DTT at 95° C. for 5 min before injection.

RNA preparation, cDNA synthesis and cloning. RNA from virions was isolated by using the guanidine thiocyanate method described by Chirgwin et al. (1979). RNA from pelleted virions (5 μg total RNA, approximately 0.5 μg HCV RNA) and 0.1 μg of random hexanucleotide primer (Pharmacia, Sweden) in 20 μl of water were heated to 65° C. for 10 min, chilled on ice, and adjusted to first strand buffer (50 mM Tris-HCl pH 8.3; 30 mM KCl; 8 mM MgCl₂ ; 1 mM DTT, DATP, dCTP, dGTP, dTTP 1 mM each and 500 units RNAguard (Ribonuclease inhibitor from human placental extract) Pharmacia, Sweden! per ml) in a final volume of 32 μl. 35 units of AMV reverse transcriptase (Life Sciences Inc., U.S.A.) were added. After 1 hour at 43° C. the reaction mixture was added to one vial of second strand synthesis mixture (cDNA synthesis kit, Pharmacia, Sweden). Second strand synthesis, preparation of blunt ends, and Eco RI adaptor ligation and phosphorylation were done as recommended by the supplier.

The cDNA was size-fractionated by preparative agarose gel electrophoresis. The part of the gel containing DNA molecules smaller than 0.5 kb was discarded. The remaining DNA was concentrated by running the gel reversely for 15 min and extracted from the agarose after 3 cycles of freezing and thawing with phenol.

Ethanol co-precipitated cDNA and λgt11 DNA (1 μg EcoRI digested dephosphorylated arms, Promega, U.S.A.) was ligated by 3 units of T4 DNA ligase (Pharmacia, Sweden) in a total volume of 10 μl ligase buffer (30 mM Tris-HCl pH 7.4; 10 mM MgCl₂ ; 10 mM DTT; 1 mM ATP). In vitro packaging with a commercially available extract (Packagene, Promega, U.S.A.) and infection of E.coli K12 cells, strain Y 1090, with resulting phages was performed as recommended by the supplier. The library was amplified once as described (Davis et al., 1986).

Screening of λgt11 library. Screening was basically performed as described (Young and Davis, 1983) using the Protoblot system purchased from Promega, U.S.A. (Huynh et al., 1985) and a serum dilution of 10⁻³. For background reduction the goat anti HCV serum was treated with E.coli lysate (strain Y1090) at 0.8 mg/ml (Huynh et al., 1985). Two positive clones having inserts of 0.8 kb and 1.8 kb, respectively could be identified.

Nick translation and Northern hybridization. 50 ng of the 0.8 kb HCV nucleic acid sequence labeled with α³² P!dCTP (3000 Ci per nMole, Amersham Buchler, FRG) by nick translation (nick translation kit, Amersham Buchler, FRG) was hybridized to Northern filters at a concentration of 5 ng per ml of hybridization mixture (5×SSC; 1×Denhardt's; 20 mM sodium phosphate pH 6.8; 0.1% SDS and 100 μg yeast tRNA Boehringer-Mannheim, FRG! per ml) at 68° C. for 12 to 14 hours. Membranes were then washed as described (Keil et al., 1984) and exposed at -70° C. to Kodak X-Omat AR films for varying times using Agfa Curix MR 800 intensifying screens.

The 0.8 kb nucleic acid sequence hybridized not only to intact HCV RNA but also to degradation products thereof. The 0.8 kb nucleic acid sequence did not hybridize to the 1.8 kb nucleic acid sequence, indicating that these two nucleic acid sequences correspond with fragments of the HCV genome which are not located in the same region of the genomic RNA.

Nucleotide sequencin. Subcloning of HCV specific phage DNA inserts into plasmid pEMBL 18 plus was done according to standard procedures (Maniatis et al., 1982). Single-stranded DNA of recombinant pEMBL plasmids was prepared as described (Dente et al., 1985). Dideoxy sequencing reactions (Sanger et al., 1977) were carried out as recommended by the supplier (Pharmacia, Sweden).

EXAMPLE 2 Molecular cloning and nucleotide sequence of the genome of HCV

RNA preparation, cDNA synthesis and cloning. RNA preparation, cDNA synthesis, size selection and ligation of co-precipitated cDNA and λgt10 DNA (1 μg EcoRI digested dephosphorylated arms, Promega, U.S.A.) were done as described above. In vitro packaging of phage DNA using Packagene (Promega, U.S.A.) and titration of phages on E.coli strain C 600 HFL were performed as suggested by the supplier. The library was amplified once (Davis et al., 1986), and replicas transferred to nictrocellulose membranes (Amersham Buchler, FRG) (Benton and Davis, 1977) were hybridized with oligonucleotides as described above for Northern hybridization. Screening with cDNA fragments labeled with α³² P! dCTP by nick translation (nick translation kit, Amersham Buchler, FRG) was done as described by Benton and Davis (1977). Positive clones were plaque purified and inserts subcloned into pEMBL plasmids (Maniatis et al., 1982; Dente et al., 1985; Davis et al., 1986).

A ³² P 5'-end labeled oligonucleotide of 17 bases complementary to the RNA sequence encoding the amino acid sequence Cys Gly Asp Asp Gly Phe was used for screening a λgt10 cDNA library. This oligonucleotide which hybridized to the about 12 kb genomic RNA of HCV, identified inter alia a clone with an insert of 0.75 kb, which hybridized also to HCV RNA. This 0.75 kb nucleic acid sequence which represents a fragment of the HCV genome together with the 0.8 kb λgt11 nucleic acid sequence insert were used for further library screening resulting in a set of overlapping HCV nucleic acid sequences of which the relative positions and restriction site maps are shown in FIG. 1. These nucleic acid sequence fragments of the HCV genome are located between the following nucleic acid positions

4.0 kb fragment: 27-4027

4.5 kb fragment: 54-4494

0.8 kb fragment: 1140-2002

4.2 kb fragment: 3246-7252

5.5 kb fragment: 6656-11819

and within about the following nucleic acid positions

3.0 kb fragment: 8920-11920

1.9 kb fragment: 10384-12284

0.75 kb fragment: 10913-11663

Nucleotide sequencing. For complete nucleotide sequence determination exonuclease III and nuclease S1 (enzymes from Boehringer Mannheim, FRG) were used to establish deletion libraries of HCV derived cDNA inserts subcloned into pEMBL 18+ or 19+ plasmids (Hennikoff, 1987). Dideoxy sequencing (Sanger et al. 1977) of single stranded (Dente et al., 1985) or double stranded DNA templates was carried out using the T7 polymerase sequencing kit (Pharmacia, Sweden).

From the cDNA fragments a continuous sequence of 12284 nucleotides in length could be determined as shown in SEQ ID NO:1!. This sequence contains one long open reading frame (ORF), starting with the ATG codon at position 364 to 366 and ending with TGA as a translational stop codon at 12058 to 12060. This ORF consists of 3898 codons capable of encoding a 435 kDa protein with an amino acid sequence shown in SEQ ID NOS:1 and 2!. Three nucleotide exchanges were detected as a result of differences in nucleotide sequence caused by possible heterogenicity of the virus population, two of which resulted in changes in the deduced amino acid sequence SEQ ID NOS:1 and 2!.

It is concluded that almost the complete HCV genome has been cloned and sequenced by the procedures described above.

The 0.8 kb λgt11 nucleic acid sequence encoding an immunogenic HCV polypeptide identified with anti HCV serum was partially sequenced (see SEQ ID NOS:12 and 13!) which revealed that this sequence is located within 1.2 and 2.0 kb on the HCV RNA.

EXAMPLE 3 Molecular cloning and expression of fusion proteins of HCV

cDNA fragments derived from two regions of the HCV genome, i.e. the 0.8 kb λgt11 insert of example 1 encoding amino acids 262-546 (see SEQ ID NOS:1 and 2!) and the nucleic acid sequence encoding amino acids 747-1071 ( SEQ ID NOS:1 and 2!), are expressed as fusion proteins in the pEx system (Strebel, K. et al., 1986).

Bacterial extracts were separated by SDS-PAGE and stained according to standard procedures, and then tested for reactivity with the goat anti-HCV serum of example 1 in a Western blot.

The HCV specific fusion proteins were partially purified by SDS-PAGE and transfered to nitrocellulose and incubated with the goat anti-HCV serum. Specific antibodies against said fusion proteins were obtained after elution.

Antibodies specific for the above-mentioned fusion proteins were employed in a radio-immuno precipitation assay.

RESULTS

Both fusion proteins expressed in the pEx system were clearly identified as HCV specific after reaction with the goat anti-HCV serum.

Monospecific antiserum prepared against both fusions proteins precipitated HCV glycoproteins.

Antibodies specific for the 262-546-fusion protein precipitated the 44/48 kD and 33 kD protein, antibodies specific for the 747-1071-fusion protein precipitated the 55 kD protein from virus infected cells.

EXAMPLE 4 Molecular cloning and expression of structural proteins via vaccinia virus

A fragment of the 4.0 kb clone shown in FIG. 1 (pHCK11) is prepared starting at the HinfI restriction site (nucleotide 372) and ending at an artificial EcoRI site (nucleotide 4000) (Maniatis et al. 1982). For the 5' end an oligonucleotide adaptor was synthesized which contained an overhang compatible to BamHI, the original ATG(364-366) as translational start codon and a protruding end compatible to HinfI at the 3' end (SEQ ID NO: 5 and 6).

    ______________________________________         5' GATCCACCATGGAGTT     HinfI     BamHI      GTGGTACCTCAACTTA 5'     ______________________________________

At the 3' end of the construct a translational stop codon was introduced by deletion of the EcoRI protruding end with Mung bean nuclease and ligation into a blunt-end Stul/EcoRI adaptor residue (SEQ ID NO: 7):

    ______________________________________             5' GCCTGAATTC 3'EcoRI                CGGACTTAAG     ______________________________________

(Maniatis et al. 1982).

Prior to inserting above-mentioned HCV sequences into vaccinia virus the heterologous gene is cloned into a recombination vector. For this purpose a pGS62 plasmid (Cranage, M. P. et al. 1986) was used which contains a cloning site downstream the P7.5K promotor within the 4.9 kb thymidine kinase sequence. The cloning site comprises three unique restriction sites, BamHI, SmaI and EcoRI. The recombination vector pGS62-3.8 was established by ligation of the described HCV sequence (372-4000) together with the adaptors into the BamHI/EcoRI digested pGS62.

Based on the plasmid a set of 15 deletion mutants was established. By treatment with ExonucleaseIII (Hennikof et al., 1987) subsequent shortening of the HCV cDNA from the 3' end was performed. All deletions are located within the region coding for the HCV 55 kD protein by removal of about 100 bp; most of the 55 kD protein is lost in mutant 15 ending at nucleotide 2589. ExoIII shortened cDNA clones were ligated into the pGS62 giving rise to pGS62-3.8Exo 1-15 (FIG. 2).

CVI cells were infected with vaccinia (strain Copenhagen, mutant TS7) at a MOI of 0.1. Three hours after infection pGS62-3.8 DNA as well as vaccinia WR DNA were transfected by the Ca₃ (PO₄)₂ precipitation method and incubated for two days. Virus progeny was harvested and selected for tk-phenotype on 143 tk-cells in the presence of brom-deoxy-Uridine (100 μg/ml). This selection was performed at least twice followed by two further cycles of plaque purification.

Characterization of vaccinia-HCV recombinants

CVI cells were infected at an MOI between 2 and 10 with vaccinia-HCV recombinants and incubated for 8-16 hours. After fixation of the cells indirect immunofluorescence was performed using either monoclonal antibodies specific for HCV 55 kD protein or polyvalent anti-HCV sera. In all cases a cytoplasmatic fluorescence could be demonstrated.

After radioimmunoprecipitation and western blot analysis of cells infected with vaccinia recombinants four HCV-specific proteins were detected. By labeling with ³ H! glucosamine it was shown that three of these proteins are glycosylated. The apparent molecular weights of these proteins were identical to those found in HCV infected cells with HCV specific sera, namely 20 kD(core), 44/48 kD, 33 kD and 55 kD.

Proteolytic processing and modifications appear to be authentic since HCV proteins produced by expression via vaccinia virus have the same apparent molecular weights as in HCV infected cells.

Induction of neutralizing antibodies against HCV in mice

Four groups of mice (3 mice/group) were infected once with

    ______________________________________     a.  Vaccinia WR wildtype                         (5 × 10.sup.6 pfu/individual)                                       WR     b.  Vaccinia 3.8 recombinant                         (5 × 10.sup.7 pfu/individual)                                       VAC3.8     c.  Vaccinia 3.8Exo 4 (55 kD                         (5 × 10.sup.7 pfu/individual)                                       VAC3.8Exo 4         deleted)     d.  Vaccinia 3.8Exo 5                         (5 × 10.sup.7 pfu/individual)                                       VAC3.8Exo 5     e.  Vaccinia 3.8Exo 15 (55 kD                         (5 × 10.sup.7 pfu/individual)                                       VAC3.8Exo 15         deleted)     ______________________________________

by injection of purified virus intraperitoneally.

Mice were bled three weeks later. The reactivity of the sera was checked in a virus neutralization assay with HCV (Alfort) on PK 15! cells after serial dilution. (Rumenapf, T. et al. 1989).

    ______________________________________     Neutralization titers     ______________________________________     a.        WR         <1:2     b.        VAC3.8        1:96     c.        VAC3.8Exo 4                             1:96     d.        VAC3.8Exo 5                          <1:2     e.        VAC3.8Exo 15                          <1.2     ______________________________________

From the above it can be concluded that vaccinia virus containing a nucleic acid sequence comprising the genetic information for all structural proteins (VAC3.8) is able to induce virus neutralizing antibodies in mice, while incomplete constructs VAC3.8Exo 5-15 and WR are not.

As all deletions are located within the region coding for HCV 55 kD protein (most of the 55 kD protein is lost in mutant 15 ending at nucleotide 2589) and the other structural proteins are still being expressed by the recombinant vaccinia virus, it is clear that the 55 kD protein is responsible for the induction of HCV neutralizing antibodies.

EXAMPLE 5 Immunization of pigs with VAC3.8

Out of three piglets (about 20 kg in weight) one animal (no. 28) was infected with wild type vaccinia virus (WR strain) and the other two (no. 26, 27) with recombinant VAC3.8 (i.p., i.v. and i.d., respectively). For infection 1×10⁸ pfu of vaccinia virus is applied to each animal.

Clinical signs in the course of vaccinia infection were apparent as erythema at the side of scarification and fever (41° C.) at day six after infection.

Titers against vaccinia and hog cholera virus:

Three weeks after infection the reactivity of the respective sera against vaccinia (WR on CVI cells) and HCV (Alfort on PK15 cells) was checked.

Neutralization was assayed after serial dilution of the sera by checking for complete absence of cpe (vaccinia) or specific signals in immunofluorescence (HCV). (Rumenapf, T. et al. 1989).

    ______________________________________     Neutralization titers against vaccinia:            pig 28 (WR)       1:8            pig 26 (VAC3.8)   1:16            pig 27 (VAC3.8)   1:16     Neutralization titers against HCV:            pig 28 (WR)     <1:2            pig 26 (VAC3.8)   1:32            pig 27 (VAC3.8)   1:16     ______________________________________

Challenge with HCV:

Four weeks after immunization with vaccinia each of the pigs was challenged by infection with 5×10⁷ TCID₅₀ HCV Alfort. Virus was applicated oronasal according to the natural route of infection. This amount of virus has been experimentally determined to be compulsory lethal for pigs.

On day five after the challenge infection pig 28 revealed fever of 41.5° C. and kept this temperature until day 12. The moribund animal was killed that day expressing typical clinical signs of acute hog cholera.

Both pigs (26, 27) immunized with VAC3.8 did not show any sign of illness after the challenge with HCV for more than 14 days.

EXAMPLE 6 Construction of a 55 kD protein expression vector

A. PRV vector.

Clone pHCK11 is digested with restriction enzymes SacI and HpaI according to standard techniques.

The resulting 1.3 kb fragment, located between nucleotides 2672 (AGCTC) and 3971 (GTT) comprising most of HCV 55 kD protein, is isolated and cloned into the pseudorabies virus (PRV) gX gene (Maniatis et al. 1982).

Briefly, the cloned gX sequence was digested with SacI and ApaI. The ApaI 5' protruding ends were made blunt by filling up with Klenow fragment. After ligation the putative gX leader peptide coding sequence was located just upstream of the inserted HCV 55 kD sequence.

A translational stop codon downstream the HCV sequence was introduced by digestion with Bgl II (Bgl II site: 3936-3941) and religation after filling up the overhangs with Klenow fragment. This construct was placed downstream of the PRV gX promotor (clone 16/4-1.3). Clone 16/4-1.3 was transfected into MDBK cells by the DEAE dextran method (Maniatis et al. 1989). 16 h. later cells were infected with PRV (m.o.i.=1). 4 h. post infection cells were fixed with a mixture of cold (-20° C.) methanol/acetone. Indirect immunofluorescence with monoclonal antibodies (MABs) anti-HCV 55 kD protein revealed a specific signal in 5-10% of the cells. PRV infected cells without transfection and cells only transfected with clone 16/4-1.3 did not show any signal in this assay.

B. Vaccinia vector.

Clone pHCK11 is digested with restriction enzymes NheI and HpaI according to standard techniques. NheI 5' protruding end was made blunt by treatment with mung bean nuclease. The resulting 1.5 kb fragment, located between nucleotides 2438 (C) and 3971 (GTT) comprising HCV 55 kD protein, is isolated and cloned into the pseudorabies virus (PRV) gx gene (Maniatis et al., 1989).

The cloned gx sequence was digested with SacI and ApaI. SacI and ApaI 3' protruding ends were made blunt by exonuclease treatment with Klenow fragment. After ligation the putative gx leader peptide coding sequence was located upstream of the inserted HCV 55 kD sequence. A translational stop codon downstream the HCV sequence was introduced by digestion with BglII (BglII site 3936-3941) and religation after filling up the overhangs with Klenow fragment. This construct was isolated by digestion with estriction enzymes AviII and ScaI. Vaccinia recombination plasmid pGS62A (Cranage et al.; 1986) is digested with SmaI. The HCV coding sequence with gx leader sequence is ligated into the SmaI site of pGS62A. CVI-cells were infected with wild type Vaccinia strain WR and transfected with pGS62A containing gp 55 coding sequences. (Macket et al., 1984) Recombinant Vaccinia viruses expressing HCV gp55 were isolated.

Metabolic labeling of CVI cells infected with the Vaccinia recombinant virus containing the HCV gp55 gene was performed. HCV gp55 was detected after radio-immuno precipitation with HCV neutralizing monoclonal antibodies, SDS-PAGE and fluorography. Under nonreducing conditions for SDS-PAGE, the disulfide linked HCV gp55 homodimer (apparent molecular weight of about 100 kD) was observed. The migration characteristics were the same as for HCV gp55 precipitated from HCV infected cells.

EXAMPLE 7 Construction of a 44/48 kD protein expression vector

Clone pHCK11 is digested with restriction enzymes BglI and BanI according to standard techniques. The resulting 0.7 kb fragment, located between nucleotide 1115 (TGTTGGC) and 1838 (GTGC) comprising the HCV 44/48 kD protein, is isolated and ligated to synthetic adaptors connecting the 5'BglI restriction site with the BamHI site of the vaccinia recombination vector pGS62A and the 3' BanI site with the EcoRI site of the vaccinia recombination vector. The sequence of the 5'adaptor is (SEQ ID NO: 8 and 9).

    ______________________________________            5'-GATCCACCATGGGGGCCCTGT-3'                   GTGGTACCCCCGGG     ______________________________________

The sequence of the 3'adaptor is (SEQ ID NO: 10 and 11)

    ______________________________________             5'-GTGCCTATGCCTGAG-3'                    GATACGGACTCTTAA     ______________________________________

CVI-cells were infected with wild type Vaccinia strain WR and transfected with pGS62A containing the gp 44/48 coding sequences. Recombinant Vaccinia viruses expressing HCV gp 44/48 were isolated.

Metabolic labeling of CVI cells infected with the Vaccinia recombinant virus containing the HCV gp 44/48 gene was performed. HCV gp 44/48 was detected after radio-immuno precipitation with monoclonal antibodies, SDS-PAGE and fluorography. Under nonreducing conditions for SDS-PAGE, the disulfide linked HCV gp 44/48 homodimer (apparent molecular weight of about 100 kD) was observed. The migration characteristics were the same as for HCV gp 44/48 precipitated from HCV infected cells. It was demonstrated that the monoclonal antibodies which precipitated gp 44/48 from cells (infected with the Vaccinia recombinant neutralize HCV.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 displays physical maps of different HCV derived cDNA clones and their position relative to the RNA genome (upper line). Two HCV derived cDNA clones isolated after screening with either the antibody probe (0.8 kb clone) or the degenerated oligonucleotide probe (0.75 kb clone) are shown in the second line. The cDNA fragments chosen for nucleotide sequencing are indicated below. All numbers represent sizes of DNA fragments in kb. Restriction sites: B=Bgl II; E=EcoRI; H=Hind III; K=Kpn I; S=Sal I; Sm=Sma I.

FIG. 2 shows the length of the HCV DNA cloned in the pGS62 vector. A set of 15 deletion mutants derived from cDNA clone pHCK11 was established by treatment with Exonuclease III and cloned in the pGS62 vector giving rise to pGS62-3.8Exo 1-15. 3' end nucleotides are indicated.

REFERENCES

BENTON, W., and DAVIS, R. (1977). Screening λgt recombinant clones by hybridization to single plaques in sity. Science 196, 180-182.

CHIRGWIN, J. M., PRZYBYLA, A. E., MACDONALD, R. J., and RUTTER, W. J. (1979). Isolation of biologically active ribonucleic acid from sources enriched in ribonuclease, Biochemistry 18, 5294-5299.

CRANAGE, M. P. et al. (1986). EMBO, J. 5, 3057-3063.

DAVIS, L. G., DIBNER, M. D., and BATTEY, J. F. (1986). Basic Methods in Molecular Biology, 190-191. Elsevier, New York, Amsterdam, London.

DENTE, L., SOLLAZZO, M., BALDARI, C., CESARENI, G., and CORTESE, R. (1985). The pEMBL family of single-stranded vectors. In: DNA Cloning, Vol. 1, (Glover, D. M., ed.), IRL Press Oxford/Washington DC, pp. 101-107.

GEYSEN et al., H. M. (1987) J. Immunol. Meth. 102, 259-274.

HENNIKOFF, S. (1987). Unidirectional digestion with exonuclease III in DNA sequence analysis. In: Meth. Enzymol. (Wu, R., ed.) 155, 156-165.

HUYNH, T. V., YOUNG, T. A., and DAVIS, R. W. (1985). Constructing and screening cDNA libraries in λgt10 and λgt11. In: DNA Cloning: A Practical Approach, Vol. 2, (Glover, D. M., ed.), IRL Press Oxford, pp. 49-78.

KEIL, G. M., EBELING-KEIL, A., and KOSZINOWSKI, U. H. (1984). Temporal regulation of murine cytomegalovirus transcription and mapping of viral RNA synthesized at immediate early times after infection, J. Virol. 50, 784-795.

MACKETT, M. et al. (1984) J. Virol. 49, 857-864.

MANIATIS, T., FRITSCH, E. F., and SAMBROOKS, S. (1982). Molecular Cloning, a Laboratory Manual. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.

SANGER, F., NICKLEN, S., and COULSON, A. R. (1977). DNA sequencing with chain-terminating inhibitors. Proc. Natl. Acad. Sci. U.S.A. 74, 5363-5467.

MANIATIS, T. et al. (1989). Molecular Cloning, a Laboratory Manual, second edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.

STREBEL, K. et al. (1986). J. Virology 57, 983-991.

RuMENOPF, T. et al. (1989). Virology 171, 18-27.

YOUNG, R. A., and DAVIS, R. W. (1983). Efficient isolation of genes by using antibody probes. Proc. Natl. Acad. Sci. U.S.A. 80, 1194-1198.

    __________________________________________________________________________     #             SEQUENCE LISTING     - (1) GENERAL INFORMATION:     -    (iii) NUMBER OF SEQUENCES: 13     - (2) INFORMATION FOR SEQ ID NO:1:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 12284 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: double               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: cDNA     -     (vi) ORIGINAL SOURCE:               (A) ORGANISM: Hog chole - #ra virus               (B) STRAIN: Alfort               (H) CELL LINE: PK 15 - #  and 38A1D     -     (ix) FEATURE:               (A) NAME/KEY: CDS               (B) LOCATION: 364..12060     #/label= 435.sub.-- kDA.sub.-- protein     -     (ix) FEATURE:               (A) NAME/KEY: primer.sub.-- - #bind     #(2587..2619) LOCATION: complement     #/label= primer.sub.-- 1RMATION:     -     (ix) FEATURE:               (A) NAME/KEY: primer.sub.-- - #bind     #(2842..2880) LOCATION: complement     #/label= primer.sub.-- 2RMATION:     -     (ix) FEATURE:               (A) NAME/KEY: variation               (B) LOCATION: replace(127, - # "c")     -     (ix) FEATURE:               (A) NAME/KEY: variation               (B) LOCATION: replace(1522 - #, "g")     -     (ix) FEATURE:               (A) NAME/KEY: variation               (B) LOCATION: replace(1098 - #9, "t")     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:     - GTTAGCTCTT TCTCGTATAC GATATTGGAT ACACTAAATT TCGATTTGGT CT - #AGGGCACC       60     - CCTCCAGCGA CGGCCGAAAT GGGCTAGCCA TGCCCATAGT AGGACTAGCA AA - #CGGAGGGA      120     - CTAGCCGTAG TGGCGAGCTC CCTGGGTGGT CTAAGTCCTG AGTACAGGAC AG - #TCGTCAGT      180     - AGTTCGACGT GAGCACTAGC CCACCTCGAG ATGCTACGTG GACGAGGGCA TG - #CCCAAGAC      240     - ACACCTTAAC CCTGGCGGGG GTCGCTAGGG TGAAATCACA TTATGTGATG GG - #GGTACGAC      300     - CTGATAGGGT GCTGCAGAGG CCCACTAGCA GGCTAGTATA AAAATCTCTG CT - #GTACATGG      360     - CAC ATG GAG TTG AAT CAT TTT GAA TTA TTA TA - #C AAA ACA AGC AAA CAA      408     #Leu Tyr Lys Thr Ser Lys GlnGlu Leu     #   15     - AAA CCA GTG GGA GTG GAG GAA CCG GTG TAT GA - #C ACC GCG GGG AGA CCA      456     Lys Pro Val Gly Val Glu Glu Pro Val Tyr As - #p Thr Ala Gly Arg Pro     #                 30     - CTA TTT GGG AAC CCA AGT GAG GTA CAC CCA CA - #A TCA ACG CTG AAG CTG      504     Leu Phe Gly Asn Pro Ser Glu Val His Pro Gl - #n Ser Thr Leu Lys Leu     #             45     - CCA CAC GAC AGG GGG AGA GGA GAT ATC AGA AC - #A ACA CTG AGG GAC CTA      552     Pro His Asp Arg Gly Arg Gly Asp Ile Arg Th - #r Thr Leu Arg Asp Leu     #         60     - CCC AGG AAA GGT GAC TGT AGG AGT GGC AAC CA - #T CTA GGC CCG GTT AGT      600     Pro Arg Lys Gly Asp Cys Arg Ser Gly Asn Hi - #s Leu Gly Pro Val Ser     #     75     - GGG ATA TAC ATA AAG CCC GGC CCT GTC TAC TA - #T CAG GAC TAC ACG GGC      648     Gly Ile Tyr Ile Lys Pro Gly Pro Val Tyr Ty - #r Gln Asp Tyr Thr Gly     # 95     - CCA GTC TAT CAC AGA GCT CCT TTA GAG TTC TT - #T GAT GAG GCC CAG TTC      696     Pro Val Tyr His Arg Ala Pro Leu Glu Phe Ph - #e Asp Glu Ala Gln Phe     #               110     - TGC GAG GTG ACT AAG AGA ATA GGC AGG GTC AC - #G GGT AGT GAT GGT AAG      744     Cys Glu Val Thr Lys Arg Ile Gly Arg Val Th - #r Gly Ser Asp Gly Lys     #           125     - CTT TAC CAC ATA TAT GTG TGC GTC GAT GGT TG - #C ATA CTG CTG AAA TTA      792     Leu Tyr His Ile Tyr Val Cys Val Asp Gly Cy - #s Ile Leu Leu Lys Leu     #       140     - GCC AAA AGG GGC ACA CCC AGA ACC CTA AAG TG - #G ATT AGG AAC TTC ACC      840     Ala Lys Arg Gly Thr Pro Arg Thr Leu Lys Tr - #p Ile Arg Asn Phe Thr     #   155     - AAC TGT CCA TTA TGG GTA ACC AGT TGC TCC GA - #T GAC GGC GCA AGT GGC      888     Asn Cys Pro Leu Trp Val Thr Ser Cys Ser As - #p Asp Gly Ala Ser Gly     160                 1 - #65                 1 - #70                 1 -     #75     - AGC AAG GAT AAG AAG CCA GAC AGA ATG AAC AA - #A GGT AAG TTG AAG ATA      936     Ser Lys Asp Lys Lys Pro Asp Arg Met Asn Ly - #s Gly Lys Leu Lys Ile     #               190     - GCC CCA AGA GAG CAT GAG AAG GAC AGC AAG AC - #C AAG CCT CCT GAT GCA      984     Ala Pro Arg Glu His Glu Lys Asp Ser Lys Th - #r Lys Pro Pro Asp Ala     #           205     - ACG ATT GTA GTA GAG GGA GTA AAA TAC CAA AT - #C AAA AAG AAA GGC AAA     1032     Thr Ile Val Val Glu Gly Val Lys Tyr Gln Il - #e Lys Lys Lys Gly Lys     #       220     - GTC AAA GGG AAG AAC ACA CAA GAC GGC CTG TA - #C CAT AAT AAG AAC AAG     1080     Val Lys Gly Lys Asn Thr Gln Asp Gly Leu Ty - #r His Asn Lys Asn Lys     #   235     - CCA CCA GAG TCC AGG AAG AAA CTA GAA AAA GC - #C CTG TTG GCT TGG GCG     1128     Pro Pro Glu Ser Arg Lys Lys Leu Glu Lys Al - #a Leu Leu Ala Trp Ala     240                 2 - #45                 2 - #50                 2 -     #55     - GTG ATA ACA ATC TTG CTG TAC CAG CCT GTA GC - #A GCC GAG AAC ATA ACT     1176     Val Ile Thr Ile Leu Leu Tyr Gln Pro Val Al - #a Ala Glu Asn Ile Thr     #               270     - CAA TGG AAC CTG AGT GAC AAC GGC ACT AAT GG - #T ATT CAG CGA GCC ATG     1224     Gln Trp Asn Leu Ser Asp Asn Gly Thr Asn Gl - #y Ile Gln Arg Ala Met     #           285     - TAT CTT AGA GGG GTT AAC AGG AGC TTA CAT GG - #G ATC TGG CCC GAG AAA     1272     Tyr Leu Arg Gly Val Asn Arg Ser Leu His Gl - #y Ile Trp Pro Glu Lys     #       300     - ATA TGC AAG GGG GTC CCC ACT CAT CTG GCC AC - #T GAC ACG GAA CTG AAA     1320     Ile Cys Lys Gly Val Pro Thr His Leu Ala Th - #r Asp Thr Glu Leu Lys     #   315     - GAG ATA CGC GGG ATG ATG GAT GCC AGC GAG AG - #G ACA AAC TAT ACG TGC     1368     Glu Ile Arg Gly Met Met Asp Ala Ser Glu Ar - #g Thr Asn Tyr Thr Cys     320                 3 - #25                 3 - #30                 3 -     #35     - TGT AGG TTA CAA AGA CAT GAA TGG AAC AAA CA - #T GGA TGG TGT AAC TGG     1416     Cys Arg Leu Gln Arg His Glu Trp Asn Lys Hi - #s Gly Trp Cys Asn Trp     #               350     - TAC AAC ATA GAC CCT TGG ATT CAG TTA ATG AA - #C AGG ACC CAA ACA AAT     1464     Tyr Asn Ile Asp Pro Trp Ile Gln Leu Met As - #n Arg Thr Gln Thr Asn     #           365     - TTG ACA GAA GGC CCT CCA GAT AAG GAG TGT GC - #C GTG ACC TGC AGG TAT     1512     Leu Thr Glu Gly Pro Pro Asp Lys Glu Cys Al - #a Val Thr Cys Arg Tyr     #       380     - GAC AAA AAT ACC GAT GTC AAC GTG GTC ACC CA - #G GCC AGG AAT AGG CCA     1560     Asp Lys Asn Thr Asp Val Asn Val Val Thr Gl - #n Ala Arg Asn Arg Pro     #   395     - ACT ACT CTG ACT GGC TGC AAG AAA GGG AAA AA - #C TTT TCA TTC GCA GGC     1608     Thr Thr Leu Thr Gly Cys Lys Lys Gly Lys As - #n Phe Ser Phe Ala Gly     400                 4 - #05                 4 - #10                 4 -     #15     - ACA GTC ATA GAG GGC CCG TGC AAT TTC AAC GT - #T TCC GTG GAG GAC ATC     1656     Thr Val Ile Glu Gly Pro Cys Asn Phe Asn Va - #l Ser Val Glu Asp Ile     #               430     - TTA TAC GGA GAC CAT GAG TGT GGC AGT CTG CT - #C CAG GAC ACG GCT CTG     1704     Leu Tyr Gly Asp His Glu Cys Gly Ser Leu Le - #u Gln Asp Thr Ala Leu     #           445     - TAC CTA TTG GAT GGA ATG ACC AAC ACT ATA GA - #G AAT GCC AGG CAA GGT     1752     Tyr Leu Leu Asp Gly Met Thr Asn Thr Ile Gl - #u Asn Ala Arg Gln Gly     #       460     - GCG GCG CGG GTG ACA TCT TGG CTT GGG AGG CA - #G CTC AGT ACC GCA GGG     1800     Ala Ala Arg Val Thr Ser Trp Leu Gly Arg Gl - #n Leu Ser Thr Ala Gly     #   475     - AAG AAG CTA GAG AGG AGA AGC AAA ACC TGG TT - #T GGT GCC TAT GCC CTG     1848     Lys Lys Leu Glu Arg Arg Ser Lys Thr Trp Ph - #e Gly Ala Tyr Ala Leu     480                 4 - #85                 4 - #90                 4 -     #95     - TCA CCT TAC TGC AAT GTG ACT AGA AAA ATA GG - #G TAC ATA TGG TAT ACA     1896     Ser Pro Tyr Cys Asn Val Thr Arg Lys Ile Gl - #y Tyr Ile Trp Tyr Thr     #               510     - AAC AAC TGC ACC CCG GCA TGC CTC CCT AAG AA - #C ACA AAA ATA ATA GGC     1944     Asn Asn Cys Thr Pro Ala Cys Leu Pro Lys As - #n Thr Lys Ile Ile Gly     #           525     - CCT GGA AAG TTT GAC ACC AAT GCG GAA GAC GG - #G AAG ATC CTT CAT GAA     1992     Pro Gly Lys Phe Asp Thr Asn Ala Glu Asp Gl - #y Lys Ile Leu His Glu     #       540     - ATG GGG GGC CAC CTA TCA GAA TTT TTG TTG CT - #T TCT CTA GTT ATC CTG     2040     Met Gly Gly His Leu Ser Glu Phe Leu Leu Le - #u Ser Leu Val Ile Leu     #   555     - TCT GAC TTT GCC CCC GAG ACA GCT AGC ACG CT - #A TAC CTA ATT TTA CAC     2088     Ser Asp Phe Ala Pro Glu Thr Ala Ser Thr Le - #u Tyr Leu Ile Leu His     560                 5 - #65                 5 - #70                 5 -     #75     - TAT GCA ATC CCC CAG TCC CAC GAA GAA CCT GA - #A GGT TGT GAT ACG AAC     2136     Tyr Ala Ile Pro Gln Ser His Glu Glu Pro Gl - #u Gly Cys Asp Thr Asn     #               590     - CAA CTT AAC CTA ACA GTG AAA CTT AGG ACA GA - #A GAC GTA GTG CCA TCA     2184     Gln Leu Asn Leu Thr Val Lys Leu Arg Thr Gl - #u Asp Val Val Pro Ser     #           605     - TCA GTT TGG AAT ATT GGC AAA TAT GTT TGT GT - #T AGA CCA GAC TGG TGG     2232     Ser Val Trp Asn Ile Gly Lys Tyr Val Cys Va - #l Arg Pro Asp Trp Trp     #       620     - CCG TAT GAA ACT AAA GTG GCT CTG CTG TTT GA - #A GAG GCA GGA CAG GTT     2280     Pro Tyr Glu Thr Lys Val Ala Leu Leu Phe Gl - #u Glu Ala Gly Gln Val     #   635     - ATA AAG CTA GTC CTA CGG GCA CTG AGG GAT TT - #A ACT AGG GTC TGG AAC     2328     Ile Lys Leu Val Leu Arg Ala Leu Arg Asp Le - #u Thr Arg Val Trp Asn     640                 6 - #45                 6 - #50                 6 -     #55     - AGC GCA TCA ACT ACT GCG TTT CTC ATT TGC TT - #G ATA AAA GTA TTG AGA     2376     Ser Ala Ser Thr Thr Ala Phe Leu Ile Cys Le - #u Ile Lys Val Leu Arg     #               670     - GGA CAG GTT GTG CAA GGT ATA ATA TGG CTG CT - #G CTG GTG ACC GGG GCA     2424     Gly Gln Val Val Gln Gly Ile Ile Trp Leu Le - #u Leu Val Thr Gly Ala     #           685     - CAA GGG CGG CTA GCC TGT AAG GAA GAC TAC AG - #G TAT GCG ATC TCG TCA     2472     Gln Gly Arg Leu Ala Cys Lys Glu Asp Tyr Ar - #g Tyr Ala Ile Ser Ser     #       700     - ACC AAT GAG ATA GGG CTG CTG GGC GCT GAA GG - #T CTC ACC ACT ACC TGG     2520     Thr Asn Glu Ile Gly Leu Leu Gly Ala Glu Gl - #y Leu Thr Thr Thr Trp     #   715     - AAA GAA TAC AGC CAC GGT TTG CAG CTG GAC GA - #C GGA ACC GTT AAG GCC     2568     Lys Glu Tyr Ser His Gly Leu Gln Leu Asp As - #p Gly Thr Val Lys Ala     720                 7 - #25                 7 - #30                 7 -     #35     - GTC TGC ACT GCA GGG TCC TTT AAA GTC ACA GC - #A CTT AAC GTG GTT AGT     2616     Val Cys Thr Ala Gly Ser Phe Lys Val Thr Al - #a Leu Asn Val Val Ser     #               750     - AGG AGG TAT CTA GCA TCA TTG CAC AAG AGG GC - #T CTA CCC ACC TCA GTG     2664     Arg Arg Tyr Leu Ala Ser Leu His Lys Arg Al - #a Leu Pro Thr Ser Val     #           765     - ACA TTT GAG CTC CTA TTT GAC GGG ACC AAC CC - #A GCA ATC GAG GAG ATG     2712     Thr Phe Glu Leu Leu Phe Asp Gly Thr Asn Pr - #o Ala Ile Glu Glu Met     #       780     - GAT GAT GAC TTC GGA TTT GGG CTG TGC CCA TT - #T GAC ACG AGT CCT GTG     2760     Asp Asp Asp Phe Gly Phe Gly Leu Cys Pro Ph - #e Asp Thr Ser Pro Val     #   795     - ATC AAA GGG AAG TAC AAC ACC ACT TTG TTA AA - #C GGC AGT GCT TTC TAT     2808     Ile Lys Gly Lys Tyr Asn Thr Thr Leu Leu As - #n Gly Ser Ala Phe Tyr     800                 8 - #05                 8 - #10                 8 -     #15     - CTA GTC TGC CCA ATA GGA TGG ACT GGT GTC GT - #A GAG TGC ACA GCA GTG     2856     Leu Val Cys Pro Ile Gly Trp Thr Gly Val Va - #l Glu Cys Thr Ala Val     #               830     - AGC CCC ACA ACC TTG AGA ACA GAA GTG GTG AA - #A ACC TTC AGG AGA GAT     2904     Ser Pro Thr Thr Leu Arg Thr Glu Val Val Ly - #s Thr Phe Arg Arg Asp     #           845     - AAG CCT TTT CCA CAT AGA GTA GAC TGT GTG AC - #C ACC ATA GTA GAA AAA     2952     Lys Pro Phe Pro His Arg Val Asp Cys Val Th - #r Thr Ile Val Glu Lys     #       860     - GAA GAC CTA TTC CAT TGC AAG TTG GGG GGT AA - #T TGG ACA TGT GTA AAA     3000     Glu Asp Leu Phe His Cys Lys Leu Gly Gly As - #n Trp Thr Cys Val Lys     #   875     - GGC GAC CCA GTG ACT TAT AAG GGG GGG CAA GT - #A AAG CAG TGC AGG TGG     3048     Gly Asp Pro Val Thr Tyr Lys Gly Gly Gln Va - #l Lys Gln Cys Arg Trp     880                 8 - #85                 8 - #90                 8 -     #95     - TGT GGT TTC GAG TTT AAA GAG CCC TAC GGG CT - #C CCA CAC TAC CCT ATA     3096     Cys Gly Phe Glu Phe Lys Glu Pro Tyr Gly Le - #u Pro His Tyr Pro Ile     #               910     - GGC AAG TGC ATC CTA ACA AAT GAG ACA GGT TA - #C AGG GTA GTA GAT TCC     3144     Gly Lys Cys Ile Leu Thr Asn Glu Thr Gly Ty - #r Arg Val Val Asp Ser     #           925     - ACA GAC TGC AAC AGA GAT GGC GTC GTT ATT AG - #C ACT GAA GGG GAA CAT     3192     Thr Asp Cys Asn Arg Asp Gly Val Val Ile Se - #r Thr Glu Gly Glu His     #       940     - GAG TGC TTG ATT GGC AAC ACT ACC GTC AAG GT - #G CAT GCA CTG GAT GAA     3240     Glu Cys Leu Ile Gly Asn Thr Thr Val Lys Va - #l His Ala Leu Asp Glu     #   955     - AGA TTG GGC CCT ATG CCG TGC AGA CCC AAA GA - #A ATC GTC TCT AGT GAG     3288     Arg Leu Gly Pro Met Pro Cys Arg Pro Lys Gl - #u Ile Val Ser Ser Glu     960                 9 - #65                 9 - #70                 9 -     #75     - GGA CCT GTG AGG AAA ACT TCT TGT ACA TTC AA - #C TAC ACA AAG ACT CTA     3336     Gly Pro Val Arg Lys Thr Ser Cys Thr Phe As - #n Tyr Thr Lys Thr Leu     #               990     - AGA AAC AAA TAC TAT GAG CCC AGA GAC AGT TA - #C TTC CAG CAA TAT ATG     3384     Arg Asn Lys Tyr Tyr Glu Pro Arg Asp Ser Ty - #r Phe Gln Gln Tyr Met     #          10050     - CTC AAG GGC GAG TAT CAA TAC TGG TTT AAT CT - #G GAC GTG ACC GAC CAC     3432     Leu Lys Gly Glu Tyr Gln Tyr Trp Phe Asn Le - #u Asp Val Thr Asp His     #      10205     - CAC ACA GAC TAC TTT GCC GAG TTT GTT GTC TT - #G GTA GTA GTA GCA CTG     3480     His Thr Asp Tyr Phe Ala Glu Phe Val Val Le - #u Val Val Val Ala Leu     #  10350     - TTA GGA GGA AGG TAC GTT CTG TGG CTA ATA GT - #G ACC TAC ATA ATT CTA     3528     Leu Gly Gly Arg Tyr Val Leu Trp Leu Ile Va - #l Thr Tyr Ile Ile Leu     #               10551045 - #                1050     - ACA GAG CAG CTC GCT GCT GGT CTA CAG CTA GG - #C CAG GGT GAG GTG GTA     3576     Thr Glu Gln Leu Ala Ala Gly Leu Gln Leu Gl - #y Gln Gly Glu Val Val     #              10705     - TTG ATA GGG AAC CTA ATT ACC CAC ACG GAC AA - #T GAG GTG GTG GTG TAC     3624     Leu Ile Gly Asn Leu Ile Thr His Thr Asp As - #n Glu Val Val Val Tyr     #          10850     - TTC CTA CTG CTC TAC TTA GTA ATA AGA GAT GA - #G CCC ATA AAG AAA TGG     3672     Phe Leu Leu Leu Tyr Leu Val Ile Arg Asp Gl - #u Pro Ile Lys Lys Trp     #      11005     - ATA CTA CTG CTG TTT CAT GCA ATG ACT AAC AA - #T CCA GTC AAG ACC ATA     3720     Ile Leu Leu Leu Phe His Ala Met Thr Asn As - #n Pro Val Lys Thr Ile     #  11150     - ACA GTA GCA TTG CTA ATG ATC AGT GGG GTT GC - #C AAG GGT GGT AAG ATA     3768     Thr Val Ala Leu Leu Met Ile Ser Gly Val Al - #a Lys Gly Gly Lys Ile     #               11351125 - #                1130     - GAT GGT GGC TGG CAG AGA CAA CCG GTG ACC AG - #T TTT GAC ATC CAA CTC     3816     Asp Gly Gly Trp Gln Arg Gln Pro Val Thr Se - #r Phe Asp Ile Gln Leu     #              11505     - GCA CTG GCA GTC GTA GTA GTC GTT GTG ATG TT - #G CTG GCA AAG AGA GAC     3864     Ala Leu Ala Val Val Val Val Val Val Met Le - #u Leu Ala Lys Arg Asp     #          11650     - CCG ACT ACT TTC CCT TTG GTA ATC ACA GTG GC - #A ACC CTG AGA ACG GCC     3912     Pro Thr Thr Phe Pro Leu Val Ile Thr Val Al - #a Thr Leu Arg Thr Ala     #      11805     - AAG ATA ACC AAC GGT TTT AGC ACA GAT CTA GT - #C ATA GCC ACA GTG TCG     3960     Lys Ile Thr Asn Gly Phe Ser Thr Asp Leu Va - #l Ile Ala Thr Val Ser     #  11950     - GCA GCT TTG TTA ACT TGG ACC TAT ATC AGC GA - #C TAC TAC AAA TAC AAG     4008     Ala Ala Leu Leu Thr Trp Thr Tyr Ile Ser As - #p Tyr Tyr Lys Tyr Lys     #               12151205 - #                1210     - ACT TGG CTA CAG TAC CTC GTC AGC ACG GTG AC - #T GGA ATC TTC CTG ATA     4056     Thr Trp Leu Gln Tyr Leu Val Ser Thr Val Th - #r Gly Ile Phe Leu Ile     #              12305     - AGG GTG CTG AAG GGA ATA GGC GAA TTG GAT CT - #G CAC GCC CCA ACC TTG     4104     Arg Val Leu Lys Gly Ile Gly Glu Leu Asp Le - #u His Ala Pro Thr Leu     #          12450     - CCG TCT CAC AGA CCC CTC TTT TAC ATC CTT GT - #A TAC CTT ATT TCC ACT     4152     Pro Ser His Arg Pro Leu Phe Tyr Ile Leu Va - #l Tyr Leu Ile Ser Thr     #      12605     - GCC GTG GTA ACT AGA TGG AAT CTG GAC GTA GC - #C GGA TTG TTG CTG CAG     4200     Ala Val Val Thr Arg Trp Asn Leu Asp Val Al - #a Gly Leu Leu Leu Gln     #  12750     - TGC GTC CCA ACT CTT TTA ATG GTT TTT ACG AT - #G TGG GCA GAC ATT CTC     4248     Cys Val Pro Thr Leu Leu Met Val Phe Thr Me - #t Trp Ala Asp Ile Leu     #               12951285 - #                1290     - ACC CTA ATT CTC ATA CTA CCT ACT TAT GAG TT - #A ACA AAG TTA TAC TAC     4296     Thr Leu Ile Leu Ile Leu Pro Thr Tyr Glu Le - #u Thr Lys Leu Tyr Tyr     #              13105     - CTT AAG GAA GTG AAG ATT GGG GCA GAA AGA GG - #T TGG CTG TGG AAA ACT     4344     Leu Lys Glu Val Lys Ile Gly Ala Glu Arg Gl - #y Trp Leu Trp Lys Thr     #          13250     - AAC TAT AAG AGG GTA AAC GAC ATC TAC GAG GT - #C GAC CAA ACT AGC GAA     4392     Asn Tyr Lys Arg Val Asn Asp Ile Tyr Glu Va - #l Asp Gln Thr Ser Glu     #      13405     - GGG GTT TAC CTT TTC CCT TCT AAA CAG AGG AC - #G AGC GCT ATA ACT AGT     4440     Gly Val Tyr Leu Phe Pro Ser Lys Gln Arg Th - #r Ser Ala Ile Thr Ser     #  13550     - ACC ATG TTG CCA TTA ATC AAA GCC ATA CTC AT - #T AGC TGC ATC AGC AAC     4488     Thr Met Leu Pro Leu Ile Lys Ala Ile Leu Il - #e Ser Cys Ile Ser Asn     #               13751365 - #                1370     - AAG TGG CAA CTC ATA TAC TTA CTG TAC TTG AT - #A TTT GAA GTG TCT TAC     4536     Lys Trp Gln Leu Ile Tyr Leu Leu Tyr Leu Il - #e Phe Glu Val Ser Tyr     #              13905     - TAC CTC CAC AAG AAA GTT ATA GAT GAA ATA GC - #T GGT GGG ACC AAC TTC     4584     Tyr Leu His Lys Lys Val Ile Asp Glu Ile Al - #a Gly Gly Thr Asn Phe     #          14050     - GTT TCA AGG CTC GTG GCG GCT TTG ATT GAA GT - #C AAT TGG GCC TTC GAC     4632     Val Ser Arg Leu Val Ala Ala Leu Ile Glu Va - #l Asn Trp Ala Phe Asp     #      14205     - AAT GAA GAA GTC AAA GGC TTA AAG AAG TTC TT - #C TTG CTG TCT AGT AGG     4680     Asn Glu Glu Val Lys Gly Leu Lys Lys Phe Ph - #e Leu Leu Ser Ser Arg     #  14350     - GTC AAA GAG TTG ATC ATC AAA CAC AAA GTG AG - #G AAT GAA GTA GTG GTC     4728     Val Lys Glu Leu Ile Ile Lys His Lys Val Ar - #g Asn Glu Val Val Val     #               14551445 - #                1450     - CGC TGG TTT GGA GAT GAA GAG ATT TAT GGG AT - #G CCA AAG CTG ATC GGC     4776     Arg Trp Phe Gly Asp Glu Glu Ile Tyr Gly Me - #t Pro Lys Leu Ile Gly     #              14705     - TTA GTT AAG GCA GCA ACA CTA AGT AGA AAC AA - #A CAC TGT ATG TTG TGT     4824     Leu Val Lys Ala Ala Thr Leu Ser Arg Asn Ly - #s His Cys Met Leu Cys     #          14850     - ACC GTC TGT GAG GAC AGA GAT TGG AGA GGG GA - #A ACT TGC CCT AAA TGT     4872     Thr Val Cys Glu Asp Arg Asp Trp Arg Gly Gl - #u Thr Cys Pro Lys Cys     #      15005     - GGG CGT TTT GGA CCA CCA GTG GTC TGC GGT AT - #G ACC CTA GCC GAT TTC     4920     Gly Arg Phe Gly Pro Pro Val Val Cys Gly Me - #t Thr Leu Ala Asp Phe     #  15150     - GAA GAA AAA CAC TAT AAA AGG ATT TTC ATT AG - #A GAG GAC CAA TCA GGC     4968     Glu Glu Lys His Tyr Lys Arg Ile Phe Ile Ar - #g Glu Asp Gln Ser Gly     #               15351525 - #                1530     - GGG CCA CTT AGG GAG GAG CAT GCA GGG TAC TT - #G CAG TAC AAA GCC AGG     5016     Gly Pro Leu Arg Glu Glu His Ala Gly Tyr Le - #u Gln Tyr Lys Ala Arg     #              15505     - GGT CAA CTG TTT TTG AGG AAC CTC CCA GTG TT - #A GCT ACA AAA GTC AAG     5064     Gly Gln Leu Phe Leu Arg Asn Leu Pro Val Le - #u Ala Thr Lys Val Lys     #          15650     - ATG CTC CTG GTT GGT AAC CTC GGG ACA GAG AT - #T GGG GAT CTG GAA CAC     5112     Met Leu Leu Val Gly Asn Leu Gly Thr Glu Il - #e Gly Asp Leu Glu His     #      15805     - CTT GGC TGG GTG CTT AGA GGG CCA GCT GTT TG - #C AAG AAG GTT ACT GAA     5160     Leu Gly Trp Val Leu Arg Gly Pro Ala Val Cy - #s Lys Lys Val Thr Glu     #  15950     - CAC GAA AGA TGC ACC ACG TCT ATA ATG GAT AA - #G TTG ACT GCT TTC TTT     5208     His Glu Arg Cys Thr Thr Ser Ile Met Asp Ly - #s Leu Thr Ala Phe Phe     #               16151605 - #                1610     - GGA GTA ATG CCA AGG GGC ACT ACT CCC AGA GC - #T CCC GTA AGA TTC CCT     5256     Gly Val Met Pro Arg Gly Thr Thr Pro Arg Al - #a Pro Val Arg Phe Pro     #              16305     - ACC TCC CTC CTA AAG ATA AGA AGA GGG CTG GA - #G ACT GGT TGG GCT TAC     5304     Thr Ser Leu Leu Lys Ile Arg Arg Gly Leu Gl - #u Thr Gly Trp Ala Tyr     #          16450     - ACA CAC CAA GGT GGC ATC AGC TCA GTA GAC CA - #T GTC ACT TGT GGG AAA     5352     Thr His Gln Gly Gly Ile Ser Ser Val Asp Hi - #s Val Thr Cys Gly Lys     #      16605     - GAC TTA CTG GTG TGT GAC ACC ATG GGT CGG AC - #A AGG GTT GTT TGC CAG     5400     Asp Leu Leu Val Cys Asp Thr Met Gly Arg Th - #r Arg Val Val Cys Gln     #  16750     - TCA AAT AAT AAG ATG ACC GAC GAG TCC GAA TA - #C GGA GTC AAA ACT GAC     5448     Ser Asn Asn Lys Met Thr Asp Glu Ser Glu Ty - #r Gly Val Lys Thr Asp     #               16951685 - #                1690     - TCC GGG TGC CCA GAG GGA GCC AGG TGT TAC GT - #G TTT AAC CCG GAA GCA     5496     Ser Gly Cys Pro Glu Gly Ala Arg Cys Tyr Va - #l Phe Asn Pro Glu Ala     #              17105     - GTT AAC ATA TCA GGC ACT AAA GGA GCC ATG GT - #C CAC TTA CAG AAA ACG     5544     Val Asn Ile Ser Gly Thr Lys Gly Ala Met Va - #l His Leu Gln Lys Thr     #          17250     - GGT GGA GAA TTC ACC TGT GTG ACA GCA TCA GG - #A ACC CCG GCC TTC TTT     5592     Gly Gly Glu Phe Thr Cys Val Thr Ala Ser Gl - #y Thr Pro Ala Phe Phe     #      17405     - GAC CTC AAG AAC CTT AAG GGC TGG TCA GGG CT - #A CCG ATA TTT GAA GCA     5640     Asp Leu Lys Asn Leu Lys Gly Trp Ser Gly Le - #u Pro Ile Phe Glu Ala     #  17550     - TCA AGT GGA AGG GTA GTC GGA AGG GTC AAG GT - #C GGG AAG AAC GAG GAT     5688     Ser Ser Gly Arg Val Val Gly Arg Val Lys Va - #l Gly Lys Asn Glu Asp     #               17751765 - #                1770     - TCC AAA CCA ACC AAG CTC ATG AGT GGG ATA CA - #A ACG GTT TCT AAA AGC     5736     Ser Lys Pro Thr Lys Leu Met Ser Gly Ile Gl - #n Thr Val Ser Lys Ser     #              17905     - GCC ACA GAC TTG ACG GAG ATG GTG AAG AAG AT - #A ACG ACC ATG AAC AGG     5784     Ala Thr Asp Leu Thr Glu Met Val Lys Lys Il - #e Thr Thr Met Asn Arg     #          18050     - GGA GAG TTC AGA CAA ATA ACC CTG GCC ACA GG - #T GCC GGA AAA ACT ACA     5832     Gly Glu Phe Arg Gln Ile Thr Leu Ala Thr Gl - #y Ala Gly Lys Thr Thr     #      18205     - GAG CTC CCT AGA TCA GTT ATA GAA GAG ATA GG - #G AGG CAT AAG AGG GTG     5880     Glu Leu Pro Arg Ser Val Ile Glu Glu Ile Gl - #y Arg His Lys Arg Val     #  18350     - TTG GTC TTA ATC CCC TTG AGG GCG GCA GCA GA - #A TCA GTA TAC CAA TAC     5928     Leu Val Leu Ile Pro Leu Arg Ala Ala Ala Gl - #u Ser Val Tyr Gln Tyr     #               18551845 - #                1850     - ATG AGA CAG AAA CAT CCG AGT ATA GCA TTC AA - #T CTA AGG ATA GGT GAG     5976     Met Arg Gln Lys His Pro Ser Ile Ala Phe As - #n Leu Arg Ile Gly Glu     #              18705     - ATG AAG GAA GGT GAT ATG GCC ACG GGA ATA AC - #C TAT GCC TCT TAC GGT     6024     Met Lys Glu Gly Asp Met Ala Thr Gly Ile Th - #r Tyr Ala Ser Tyr Gly     #          18850     - TAC TTT TGC CAG ATG TCA CAA CCC AAG CTG AG - #A GCC GCA ATG GTA GAA     6072     Tyr Phe Cys Gln Met Ser Gln Pro Lys Leu Ar - #g Ala Ala Met Val Glu     #      19005     - TAT TCC TTT ATA TTC CTA GAT GAG TAT CAT TG - #T GCT ACC CCA GAA CAA     6120     Tyr Ser Phe Ile Phe Leu Asp Glu Tyr His Cy - #s Ala Thr Pro Glu Gln     #  19150     - CTG GCA ATC ATG GGG AAG ATC CAC AGA TTC TC - #A GAA AAC CTG CGG GTG     6168     Leu Ala Ile Met Gly Lys Ile His Arg Phe Se - #r Glu Asn Leu Arg Val     #               19351925 - #                1930     - GTA GCT ATG ACA GCG ACA CCG GCA GGC ACA GT - #A ACA ACC ACT GGG CAG     6216     Val Ala Met Thr Ala Thr Pro Ala Gly Thr Va - #l Thr Thr Thr Gly Gln     #              19505     - AAA CAC CCT ATA GAG GAA TTT ATA GCC CCG GA - #A GTG ATG AAA GGA GAA     6264     Lys His Pro Ile Glu Glu Phe Ile Ala Pro Gl - #u Val Met Lys Gly Glu     #          19650     - GAC TTG GGT TCT GAG TAC TTA GAT ATT GCC GG - #A CTG AAG ATA CCA GTA     6312     Asp Leu Gly Ser Glu Tyr Leu Asp Ile Ala Gl - #y Leu Lys Ile Pro Val     #      19805     - GAG GAG ATG AAG AAT AAC ATG CTA GTT TTT GT - #G CCC ACC AGG AAC ATG     6360     Glu Glu Met Lys Asn Asn Met Leu Val Phe Va - #l Pro Thr Arg Asn Met     #  19950     - GCG GTA GAG GCG GCA AAG AAA TTG AAG GCC AA - #A GGA TAC AAC TCG GGC     6408     Ala Val Glu Ala Ala Lys Lys Leu Lys Ala Ly - #s Gly Tyr Asn Ser Gly     #               20152005 - #                2010     - TAC TAC TAC AGC GGA GAG GAC CCA TCT AAC CT - #G AGG GTG GTG ACG TCG     6456     Tyr Tyr Tyr Ser Gly Glu Asp Pro Ser Asn Le - #u Arg Val Val Thr Ser     #              20305     - CAG TCC CCA TAC GTG GTG GTA GCA ACC AAC GC - #A ATA GAA TCG GGC GTT     6504     Gln Ser Pro Tyr Val Val Val Ala Thr Asn Al - #a Ile Glu Ser Gly Val     #          20450     - ACC CTC CCG GAC CTG GAC GTG GTT GTC GAC AC - #G GGA CTC AAG TGT GAA     6552     Thr Leu Pro Asp Leu Asp Val Val Val Asp Th - #r Gly Leu Lys Cys Glu     #      20605     - AAA AGA ATC CGA CTG TCA CCC AAG ATG CCT TT - #C ATA GTG ACG GGC CTG     6600     Lys Arg Ile Arg Leu Ser Pro Lys Met Pro Ph - #e Ile Val Thr Gly Leu     #  20750     - AAA AGA ATG GCC GTC ACT ATT GGG GAA CAA GC - #C CAG AGA AGA GGG AGG     6648     Lys Arg Met Ala Val Thr Ile Gly Glu Gln Al - #a Gln Arg Arg Gly Arg     #               20952085 - #                2090     - GTT GGA AGA GTG AAG CCC GGG AGA TAC TAC AG - #G AGT CAA GAA ACA CCT     6696     Val Gly Arg Val Lys Pro Gly Arg Tyr Tyr Ar - #g Ser Gln Glu Thr Pro     #              21105     - GTC GGC TCT AAA GAC TAC CAT TAT GAC TTA TT - #G CAA GCC CAG AGG TAC     6744     Val Gly Ser Lys Asp Tyr His Tyr Asp Leu Le - #u Gln Ala Gln Arg Tyr     #          21250     - GGC ATA GAA GAT GGG ATA AAT ATC ACC AAA TC - #C TTC AGA GAG ATG AAC     6792     Gly Ile Glu Asp Gly Ile Asn Ile Thr Lys Se - #r Phe Arg Glu Met Asn     #      21405     - TAC GAC TGG AGC CTT TAT GAG GAA GAT AGC CT - #G ATG ATC ACA CAA CTG     6840     Tyr Asp Trp Ser Leu Tyr Glu Glu Asp Ser Le - #u Met Ile Thr Gln Leu     #  21550     - GAA ATC CTC AAC AAC CTG TTG ATA TCA GAA GA - #G CTG CCG ATG GCA GTA     6888     Glu Ile Leu Asn Asn Leu Leu Ile Ser Glu Gl - #u Leu Pro Met Ala Val     #               21752165 - #                2170     - AAA AAT ATA ATG GCC AGG ACC GAC CAC CCA GA - #A CCA ATT CAA CTC GCG     6936     Lys Asn Ile Met Ala Arg Thr Asp His Pro Gl - #u Pro Ile Gln Leu Ala     #              21905     - TAT AAC AGC TAC GAG ACA CAG GTG CCG GTA TT - #A TTC CCA AAA ATA AGA     6984     Tyr Asn Ser Tyr Glu Thr Gln Val Pro Val Le - #u Phe Pro Lys Ile Arg     #          22050     - AAT GGA GAG GTG ACT GAT ACT TAC GAT AAT TA - #C ACC TTC CTC AAT GCA     7032     Asn Gly Glu Val Thr Asp Thr Tyr Asp Asn Ty - #r Thr Phe Leu Asn Ala     #      22205     - AGA AAA TTG GGA GAT GAC GTA CCC CCC TAC GT - #G TAT GCT ACA GAG GAT     7080     Arg Lys Leu Gly Asp Asp Val Pro Pro Tyr Va - #l Tyr Ala Thr Glu Asp     #  22350     - GAG GAC TTG GCA GTG GAA CTG TTG GGC CTA GA - #T TGG CCG GAC CCA GGA     7128     Glu Asp Leu Ala Val Glu Leu Leu Gly Leu As - #p Trp Pro Asp Pro Gly     #               22552245 - #                2250     - AAC CAA GGC ACC GTG GAA GCT GGC AGA GCA CT - #A AAA CAG GTG GTT GGT     7176     Asn Gln Gly Thr Val Glu Ala Gly Arg Ala Le - #u Lys Gln Val Val Gly     #              22705     - CTA TCA ACA GCA GAG AAC GCC CTG CTA GTC GC - #C CTG TTC GGC TAC GTG     7224     Leu Ser Thr Ala Glu Asn Ala Leu Leu Val Al - #a Leu Phe Gly Tyr Val     #          22850     - GGG TAC CAG GCG CTT TCA AAG AGA CAT ATA CC - #A GTG GTC ACA GAT ATA     7272     Gly Tyr Gln Ala Leu Ser Lys Arg His Ile Pr - #o Val Val Thr Asp Ile     #      23005     - TAT TCA GTA GAA GAT CAC AGG CTA GAG GAC AC - #T ACG CAC CTA CAG TAT     7320     Tyr Ser Val Glu Asp His Arg Leu Glu Asp Th - #r Thr His Leu Gln Tyr     #  23150     - GCT CCG AAT GCC ATC AAG ACG GAG GGG AAG GA - #A ACT GAA TTG AAG GAG     7368     Ala Pro Asn Ala Ile Lys Thr Glu Gly Lys Gl - #u Thr Glu Leu Lys Glu     #               23352325 - #                2330     - CTG GCT CAG GGG GAT GTG CAG AGA TGT GTG GA - #A GCA GTG ACC AAT TAT     7416     Leu Ala Gln Gly Asp Val Gln Arg Cys Val Gl - #u Ala Val Thr Asn Tyr     #              23505     - GCG AGA GAG GGC ATC CAA TTC ATG AAG TCG CA - #G GCA CTG AAA GTG AGA     7464     Ala Arg Glu Gly Ile Gln Phe Met Lys Ser Gl - #n Ala Leu Lys Val Arg     #          23650     - GAA ACC CCT ACC TAT AAA GAG ACA ATG AAC AC - #C GTG GCA GAT TAT GTG     7512     Glu Thr Pro Thr Tyr Lys Glu Thr Met Asn Th - #r Val Ala Asp Tyr Val     #      23805     - AAA AAG TTT ATT GAG GCA CTG ACG GAT AGC AA - #G GAA GAC ATC ATT AAA     7560     Lys Lys Phe Ile Glu Ala Leu Thr Asp Ser Ly - #s Glu Asp Ile Ile Lys     #  23950     - TAT GGG CTG TGG GGG GCA CAT ACG GCA TTG TA - #T AAG AGC ATT GGT GCC     7608     Tyr Gly Leu Trp Gly Ala His Thr Ala Leu Ty - #r Lys Ser Ile Gly Ala     #               24152405 - #                2410     - AGG CTT GGT CAC GAA ACC GCG TTC GCA ACT CT - #A GTT GTG AAG TGG TTG     7656     Arg Leu Gly His Glu Thr Ala Phe Ala Thr Le - #u Val Val Lys Trp Leu     #              24305     - GCA TTT GGG GGG GAG TCA ATA TCA GAC CAC AT - #A AAG CAA GCG GCC ACA     7704     Ala Phe Gly Gly Glu Ser Ile Ser Asp His Il - #e Lys Gln Ala Ala Thr     #          24450     - GAC TTG GTC GTT TAT TAC ATT ATT AAC AGA CC - #T CAA TTC CCA GGA GAC     7752     Asp Leu Val Val Tyr Tyr Ile Ile Asn Arg Pr - #o Gln Phe Pro Gly Asp     #      24605     - ACA GAA ACA CAA CAA GAA GGG AGA AAA TTT GT - #T GCC AGC CTG CTA GTC     7800     Thr Glu Thr Gln Gln Glu Gly Arg Lys Phe Va - #l Ala Ser Leu Leu Val     #  24750     - TCA GCT CTA GCG ACT TAT ACA TAC AAG AGC TG - #G AAC TAC AAT AAT CTG     7848     Ser Ala Leu Ala Thr Tyr Thr Tyr Lys Ser Tr - #p Asn Tyr Asn Asn Leu     #               24952485 - #                2490     - TCC AAA ATA GTT GAA CCG GCT TTG GCT ACC CT - #G CCC TAT GCC GCT AAA     7896     Ser Lys Ile Val Glu Pro Ala Leu Ala Thr Le - #u Pro Tyr Ala Ala Lys     #              25105     - GCC CTC AAG CTA TTT GCT CCT ACC CGA CTG GA - #G AGC GTT GTC ATA CTG     7944     Ala Leu Lys Leu Phe Ala Pro Thr Arg Leu Gl - #u Ser Val Val Ile Leu     #          25250     - AGC ACT GCA ATC TAC AAA ACA TAC CTA TCA AT - #A AGG CGA GGC AAA AGT     7992     Ser Thr Ala Ile Tyr Lys Thr Tyr Leu Ser Il - #e Arg Arg Gly Lys Ser     #      25405     - GAT GGT CTG CTA GGT ACA GGG GTT AGC GCG GC - #C ATG GAA ATT ATG TCA     8040     Asp Gly Leu Leu Gly Thr Gly Val Ser Ala Al - #a Met Glu Ile Met Ser     #  25550     - CAA AAC CCA GTA TCT GTG GGT ATA GCA GTT AT - #G CTA GGG GTA GGG GCT     8088     Gln Asn Pro Val Ser Val Gly Ile Ala Val Me - #t Leu Gly Val Gly Ala     #               25752565 - #                2570     - GTA GCA GCC CAC AAT GCA ATT GAA GCC AGT GA - #G CAA AAA AGA ACA CTA     8136     Val Ala Ala His Asn Ala Ile Glu Ala Ser Gl - #u Gln Lys Arg Thr Leu     #              25905     - CTT ATG AAA GTC TTT GTG AAA AAC TTC TTA GA - #C CAG GCC GCC ACC GAC     8184     Leu Met Lys Val Phe Val Lys Asn Phe Leu As - #p Gln Ala Ala Thr Asp     #          26050     - GAA CTA GTC AAA GAG AGC CCT GAG AAA ATA AT - #A ATG GCT TTG TTC GAA     8232     Glu Leu Val Lys Glu Ser Pro Glu Lys Ile Il - #e Met Ala Leu Phe Glu     #      26205     - GCG GTG CAA ACG GTG GGC AAC CCT CTT AGA TT - #A GTG TAC CAC CTC TAT     8280     Ala Val Gln Thr Val Gly Asn Pro Leu Arg Le - #u Val Tyr His Leu Tyr     #  26350     - GGA GTT TTC TAT AAA GGG TGG GAA GCA AAA GA - #G TTG GCC CAA AGA ACA     8328     Gly Val Phe Tyr Lys Gly Trp Glu Ala Lys Gl - #u Leu Ala Gln Arg Thr     #               26552645 - #                2650     - GCC GGC AGG AAC CTT TTC ACC TTG ATA ATG TT - #C GAG GCT GTG GAA CTA     8376     Ala Gly Arg Asn Leu Phe Thr Leu Ile Met Ph - #e Glu Ala Val Glu Leu     #              26705     - CTG GGA GTA GAC AGT GAG GGA AAA ATT CGC CA - #G CTA TCG AGC AAT TAC     8424     Leu Gly Val Asp Ser Glu Gly Lys Ile Arg Gl - #n Leu Ser Ser Asn Tyr     #          26850     - ATA CTA GAG CTC TTG TAT AAG TTC CGC GAC AA - #T ATC AAG TCT AGT GTG     8472     Ile Leu Glu Leu Leu Tyr Lys Phe Arg Asp As - #n Ile Lys Ser Ser Val     #      27005     - AGG GAG ATA GCA ATC AGC TGG GCC CCC GCC CC - #C TTT AGT TGC GAT TGG     8520     Arg Glu Ile Ala Ile Ser Trp Ala Pro Ala Pr - #o Phe Ser Cys Asp Trp     #  27150     - ACA CCA ACA GAT GAC AGA ATA GGG CTT CCC CA - #T GAC AAT TAC CTC CGA     8568     Thr Pro Thr Asp Asp Arg Ile Gly Leu Pro Hi - #s Asp Asn Tyr Leu Arg     #               27352725 - #                2730     - GTG GAG ACA AAG TGC CCC TGT GGT TAC AGG AT - #G AAA GCG GTA AAA AAC     8616     Val Glu Thr Lys Cys Pro Cys Gly Tyr Arg Me - #t Lys Ala Val Lys Asn     #              27505     - TGC GCT GGG GAG TTG AGA CTT CTG GAG GAA GG - #G GGT TCA TTC CTC TGC     8664     Cys Ala Gly Glu Leu Arg Leu Leu Glu Glu Gl - #y Gly Ser Phe Leu Cys     #          27650     - AGA AAT AAA TTC GGT AGA GGC TCA CAA AAC TA - #C AGG GTG ACA AAA TAC     8712     Arg Asn Lys Phe Gly Arg Gly Ser Gln Asn Ty - #r Arg Val Thr Lys Tyr     #      27805     - TAT GAT GAC AAT TTA TCA GAA ATA AAA CCA GT - #G ATA AGA ATG GAA GGA     8760     Tyr Asp Asp Asn Leu Ser Glu Ile Lys Pro Va - #l Ile Arg Met Glu Gly     #  27950     - CAC GTG GAA CTG TAT TAC AAG GGG GCC ACT AT - #C AAA CTG GAT TTT AAC     8808     His Val Glu Leu Tyr Tyr Lys Gly Ala Thr Il - #e Lys Leu Asp Phe Asn     #               28152805 - #                2810     - AAC AGT AAA ACG GTA CTG GCA ACT GAC AAA TG - #G GAG GTT GAC CAC TCC     8856     Asn Ser Lys Thr Val Leu Ala Thr Asp Lys Tr - #p Glu Val Asp His Ser     #              28305     - ACC CTG GTT AGG GCA CTC AAG AGG TAC ACA GG - #G GCT GGA TAT CGA GGG     8904     Thr Leu Val Arg Ala Leu Lys Arg Tyr Thr Gl - #y Ala Gly Tyr Arg Gly     #          28450     - GCG TAT TTG GGT GAG AAA CCT AAC CAT AAA CA - #T CTG ATA CAG AGA GAC     8952     Ala Tyr Leu Gly Glu Lys Pro Asn His Lys Hi - #s Leu Ile Gln Arg Asp     #      28605     - TGT GCA ACG ATT ACC AAA GAC AAG GTC TGC TT - #C ATC AAA ATG AAG AGA     9000     Cys Ala Thr Ile Thr Lys Asp Lys Val Cys Ph - #e Ile Lys Met Lys Arg     #  28750     - GGG TGT GCG TTC ACT TAT GAC CTA TCC CTC CA - #C AAC CTT ACC CGG CTA     9048     Gly Cys Ala Phe Thr Tyr Asp Leu Ser Leu Hi - #s Asn Leu Thr Arg Leu     #               28952885 - #                2890     - ATC GAA TTG GTA CAC AAG AAT AAC CTG GAA GA - #T AGA GAA ATC CCT GCT     9096     Ile Glu Leu Val His Lys Asn Asn Leu Glu As - #p Arg Glu Ile Pro Ala     #              29105     - GTG ACG GTT ACA ACC TGG CTG GCC TAC ACA TT - #T GTG AAT GAA GAC ATA     9144     Val Thr Val Thr Thr Trp Leu Ala Tyr Thr Ph - #e Val Asn Glu Asp Ile     #          29250     - GGG ACC ATA AAA CCA ACT TTT GGG GAA AAG GT - #G ACA CCG GAG AAA CAG     9192     Gly Thr Ile Lys Pro Thr Phe Gly Glu Lys Va - #l Thr Pro Glu Lys Gln     #      29405     - GAG GAG GTA GTC TTG CAG CCT GCT GTG GTG GT - #G GAC ACA ACA GAT GTA     9240     Glu Glu Val Val Leu Gln Pro Ala Val Val Va - #l Asp Thr Thr Asp Val     #  29550     - GCC GTG ACC GTG GTA GGG GAA ACC TCT ACT AT - #G ACT ACA GGG GAG ACC     9288     Ala Val Thr Val Val Gly Glu Thr Ser Thr Me - #t Thr Thr Gly Glu Thr     #               29752965 - #                2970     - CCG ACA ACA TTT ACC AGC TTA GGT TCG GAC TC - #G AAG GTC CGA CAA GTC     9336     Pro Thr Thr Phe Thr Ser Leu Gly Ser Asp Se - #r Lys Val Arg Gln Val     #              29905     - CTG AAG CTG GGC GTG GAC GAT GGT CAA TAC CC - #C GGG CCT AAT CAG CAG     9384     Leu Lys Leu Gly Val Asp Asp Gly Gln Tyr Pr - #o Gly Pro Asn Gln Gln     #          30050     - AGA GCA AGC CTG CTC GAA GCT ATA CAA GGT GT - #G GAT GAA AGG CCC TCG     9432     Arg Ala Ser Leu Leu Glu Ala Ile Gln Gly Va - #l Asp Glu Arg Pro Ser     #      30205     - GTA CTG ATA CTG GGG TCT GAT AAG GCC ACC TC - #C AAT AGG GTC AAG ACC     9480     Val Leu Ile Leu Gly Ser Asp Lys Ala Thr Se - #r Asn Arg Val Lys Thr     #  30350     - GCA AAG AAT GTG AAG ATA TAT AGG AGC AGG GA - #C CCC CTG GAA CTG AGA     9528     Ala Lys Asn Val Lys Ile Tyr Arg Ser Arg As - #p Pro Leu Glu Leu Arg     #               30553045 - #                3050     - GAA ATG ATG AAA AGG GGA AAA ATC CTA GTC GT - #A GCC TTG TCT AGA GTC     9576     Glu Met Met Lys Arg Gly Lys Ile Leu Val Va - #l Ala Leu Ser Arg Val     #              30705     - GAT ACC GCT CTG CTG AAA TTC GTT GAT TAC AA - #A GGC ACC TTC CTG ACC     9624     Asp Thr Ala Leu Leu Lys Phe Val Asp Tyr Ly - #s Gly Thr Phe Leu Thr     #          30850     - AGA GAG ACC CTA GAG GCA TTA AGT CTG GGT AA - #G CCT AAG AAA AGA GAC     9672     Arg Glu Thr Leu Glu Ala Leu Ser Leu Gly Ly - #s Pro Lys Lys Arg Asp     #      31005     - ATA ACT AAA GCA GAA GCA CAA TGG CTG CTG CG - #C CTC GAA GAC CAA ATA     9720     Ile Thr Lys Ala Glu Ala Gln Trp Leu Leu Ar - #g Leu Glu Asp Gln Ile     #  31150     - GAA GAG CTG CCT GAC TGG TTC GCA GCC AAG GA - #A CCC ATA TTT CTA GAA     9768     Glu Glu Leu Pro Asp Trp Phe Ala Ala Lys Gl - #u Pro Ile Phe Leu Glu     #               31353125 - #                3130     - GCC AAC ATT AAA CGT GAC AAG TAT CAC CTG GT - #A GGG GAC ATA GCC ACT     9816     Ala Asn Ile Lys Arg Asp Lys Tyr His Leu Va - #l Gly Asp Ile Ala Thr     #              31505     - ATT AAA GAA AAA GCC AAA CAA CTG GGG GCA AC - #A GAC TCC ACA AAG ATA     9864     Ile Lys Glu Lys Ala Lys Gln Leu Gly Ala Th - #r Asp Ser Thr Lys Ile     #          31650     - TCA AAG GAG GTT GGC GCG AAA GTG TAT TCT AT - #G AAG CTG AGT AAC TGG     9912     Ser Lys Glu Val Gly Ala Lys Val Tyr Ser Me - #t Lys Leu Ser Asn Trp     #      31805     - GTG ATA CAA GAA GAG AAT AAA CAA GGC AGC CT - #T GCC CCC CTG TTT GAA     9960     Val Ile Gln Glu Glu Asn Lys Gln Gly Ser Le - #u Ala Pro Leu Phe Glu     #  31950     - GAG CTC CTG CAA CAG TGC CCA CCC GGG GGC CA - #G AAC AAA ACC ACA CAT     10008     Glu Leu Leu Gln Gln Cys Pro Pro Gly Gly Gl - #n Asn Lys Thr Thr His     #               32153205 - #                3210     - ATG GTC TCA GCC TAC CAA CTA GCT CAA GGG AA - #T TGG GTG CCA GTT AGT     10056     Met Val Ser Ala Tyr Gln Leu Ala Gln Gly As - #n Trp Val Pro Val Ser     #              32305     - TGC CAC GTG TTC ATG GGG ACC ATA CCC GCC AG - #A AGA ACC AAG ACT CAT     10104     Cys His Val Phe Met Gly Thr Ile Pro Ala Ar - #g Arg Thr Lys Thr His     #          32450     - CCT TAT GAG GCA TAC GTT AAG CTA AGG GAG TT - #G GTA GAT GAA CAT AAG     10152     Pro Tyr Glu Ala Tyr Val Lys Leu Arg Glu Le - #u Val Asp Glu His Lys     #      32605     - ATG AAG GCA TTA TGT GGC GGA TCA GGC CTA AG - #T AAG CAC AAC GAA TGG     10200     Met Lys Ala Leu Cys Gly Gly Ser Gly Leu Se - #r Lys His Asn Glu Trp     #  32750     - GTA ATT GGC AAG GTC AAG TAT CAA GGA AAC CT - #G AGG ACC AAA CAC ATG     10248     Val Ile Gly Lys Val Lys Tyr Gln Gly Asn Le - #u Arg Thr Lys His Met     #               32953285 - #                3290     - TTG AAC CCC GGA AAG GTG GCG GAG CAA CTG CA - #C AGA GAA GGG TAC AGG     10296     Leu Asn Pro Gly Lys Val Ala Glu Gln Leu Hi - #s Arg Glu Gly Tyr Arg     #              33105     - CAC AAT GTG TAT AAT AAG ACA ATA GGT TCA GT - #G ATG ACA GCA ACT GGT     10344     His Asn Val Tyr Asn Lys Thr Ile Gly Ser Va - #l Met Thr Ala Thr Gly     #          33250     - ATC AGG CTG GAG AAG TTA CCT GTG GTT AGG GC - #C CAA ACA GAC ACA ACC     10392     Ile Arg Leu Glu Lys Leu Pro Val Val Arg Al - #a Gln Thr Asp Thr Thr     #      33405     - AAC TTC CAC CAA GCA ATA AGG GAT AAA ATA GA - #C AAG GAG GAG AAC CTA     10440     Asn Phe His Gln Ala Ile Arg Asp Lys Ile As - #p Lys Glu Glu Asn Leu     #  33550     - CAG ACC CCT GGC TTG CAT AAG AAG TTA ATG GA - #A GTC TTC AAT GCA TTA     10488     Gln Thr Pro Gly Leu His Lys Lys Leu Met Gl - #u Val Phe Asn Ala Leu     #               33753365 - #                3370     - AAA AGA CCC GAG CTT GAG GCC TCT TAT GAC GC - #T GTG GAT TGG GAG GAA     10536     Lys Arg Pro Glu Leu Glu Ala Ser Tyr Asp Al - #a Val Asp Trp Glu Glu     #              33905     - TTG GAG AGA GGA ATA AAT AGG AAG GGT GCT GC - #T GGT TTC TTC GAA CGC     10584     Leu Glu Arg Gly Ile Asn Arg Lys Gly Ala Al - #a Gly Phe Phe Glu Arg     #          34050     - AAG AAC ATA GGA GAG GTT TTG GAT TCG GAA AA - #A AAT AAA GTC GAA GAG     10632     Lys Asn Ile Gly Glu Val Leu Asp Ser Glu Ly - #s Asn Lys Val Glu Glu     #      34205     - GTT ATT GAC AGT TTG AAA AAA GGT AGG AAT AT - #C AGA TAC TAC GAA ACT     10680     Val Ile Asp Ser Leu Lys Lys Gly Arg Asn Il - #e Arg Tyr Tyr Glu Thr     #  34350     - GCA ATC CCG AAA AAC GAG AAG AGG GAT GTC AA - #T GAT GAC TGG ACC GCT     10728     Ala Ile Pro Lys Asn Glu Lys Arg Asp Val As - #n Asp Asp Trp Thr Ala     #               34553445 - #                3450     - GGT GAC TTC GTA GAT GAG AAG AAG CCA AGA GT - #G ATA CAA TAC CCT GAG     10776     Gly Asp Phe Val Asp Glu Lys Lys Pro Arg Va - #l Ile Gln Tyr Pro Glu     #              34705     - GCT AAA ACT AGG TTG GCT ATT ACT AAG GTA AT - #G TAC AAG TGG GTC AAA     10824     Ala Lys Thr Arg Leu Ala Ile Thr Lys Val Me - #t Tyr Lys Trp Val Lys     #          34850     - CAG AAG CCA GTT GTC ATA CCG GGT TAT GAA GG - #T AAG ACA CCC CTG TTT     10872     Gln Lys Pro Val Val Ile Pro Gly Tyr Glu Gl - #y Lys Thr Pro Leu Phe     #      35005     - CAA ATT TTT GAC AAA GTG AAG AAA GAA TGG GA - #T CAA TTC CAA AAC CCT     10920     Gln Ile Phe Asp Lys Val Lys Lys Glu Trp As - #p Gln Phe Gln Asn Pro     #  35150     - GTG GCA GTT AGC TTT GAT ACC AAA GCG TGG GA - #T ACC CAG GTA ACC ACA     10968     Val Ala Val Ser Phe Asp Thr Lys Ala Trp As - #p Thr Gln Val Thr Thr     #               35353525 - #                3530     - AGG GAT TTG GAG CTA ATA AGG GAT ATA CAG AA - #G TTC TAT TTT AAA AAG     11016     Arg Asp Leu Glu Leu Ile Arg Asp Ile Gln Ly - #s Phe Tyr Phe Lys Lys     #              35505     - AAA TGG CAC AAA TTC ATT GAC ACC CTA ACC AA - #G CAC ATG TCA GAA GTA     11064     Lys Trp His Lys Phe Ile Asp Thr Leu Thr Ly - #s His Met Ser Glu Val     #          35650     - CCC GTA ATC AGT GCC GAC GGG GAG GTA TAC AT - #A AGG AAA GGT CAG AGA     11112     Pro Val Ile Ser Ala Asp Gly Glu Val Tyr Il - #e Arg Lys Gly Gln Arg     #      35805     - GGC AGT GGG CAA CCT GAC ACG AGC GCA GGC AA - #C AGC ATG TTG AAT GTG     11160     Gly Ser Gly Gln Pro Asp Thr Ser Ala Gly As - #n Ser Met Leu Asn Val     #  35950     - TTG ACA ATG GTG TAT GCC TTC TGC GAG GCC AC - #G GGG GTA CCC TAC AAG     11208     Leu Thr Met Val Tyr Ala Phe Cys Glu Ala Th - #r Gly Val Pro Tyr Lys     #               36153605 - #                3610     - AGT TTT GAC AGA GTG GCA AAG ATC CAT GTC TG - #C GGG GAT GAT GGT TTC     11256     Ser Phe Asp Arg Val Ala Lys Ile His Val Cy - #s Gly Asp Asp Gly Phe     #              36305     - CTG ATT ACC GAA AGA GCT CTC GGT GAG AAA TT - #T GCG AGT AAA GGA GTC     11304     Leu Ile Thr Glu Arg Ala Leu Gly Glu Lys Ph - #e Ala Ser Lys Gly Val     #          36450     - CAG ATC CTA TAC GAA GCT GGG AAG CCT CAA AA - #G ATC ACT GAA GGG GAC     11352     Gln Ile Leu Tyr Glu Ala Gly Lys Pro Gln Ly - #s Ile Thr Glu Gly Asp     #      36605     - AAG ATG AAA GTA GCC TAT CAG TTT GAT GAT AT - #C GAG TTC TGC TCC CAT     11400     Lys Met Lys Val Ala Tyr Gln Phe Asp Asp Il - #e Glu Phe Cys Ser His     #  36750     - ACA CCA GTA CAA GTG AGG TGG TCA GAC AAT AC - #T TCC AGC TAC ATG CCG     11448     Thr Pro Val Gln Val Arg Trp Ser Asp Asn Th - #r Ser Ser Tyr Met Pro     #               36953685 - #                3690     - GGA AGG AAC ACG ACT ACA ATC CTG GCT AAA AT - #G GCT ACA AGG TTG GAT     11496     Gly Arg Asn Thr Thr Thr Ile Leu Ala Lys Me - #t Ala Thr Arg Leu Asp     #              37105     - TCC AGT GGT GAG AGG GGT ACT ATA GCA TAT GA - #G AAG GCA GTG GCG TTC     11544     Ser Ser Gly Glu Arg Gly Thr Ile Ala Tyr Gl - #u Lys Ala Val Ala Phe     #          37250     - AGC TTT TTG TTG ATG TAC TCC TGG AAC CCA CT - #G ATC AGA AGG ATA TGC     11592     Ser Phe Leu Leu Met Tyr Ser Trp Asn Pro Le - #u Ile Arg Arg Ile Cys     #      37405     - TTA CTG GTG TTG TCA ACT GAG TTG CAA GTG AG - #A CCA GGG AAG TCA ACC     11640     Leu Leu Val Leu Ser Thr Glu Leu Gln Val Ar - #g Pro Gly Lys Ser Thr     #  37550     - ACC TAT TAC TAT GAA GGG GAC CCA ATA TCC GC - #T TAC AAG GAA GTC ATT     11688     Thr Tyr Tyr Tyr Glu Gly Asp Pro Ile Ser Al - #a Tyr Lys Glu Val Ile     #               37753765 - #                3770     - GGC CAC AAT CTC TTT GAC CTT AAA AGA ACA AG - #C TTC GAA AAG CTA GCA     11736     Gly His Asn Leu Phe Asp Leu Lys Arg Thr Se - #r Phe Glu Lys Leu Ala     #              37905     - AAG TTA AAT CTC AGC ATG TCC ACG CTC GGG GT - #G TGG ACT AGA CAC ACT     11784     Lys Leu Asn Leu Ser Met Ser Thr Leu Gly Va - #l Trp Thr Arg His Thr     #          38050     - AGC AAG AGA TTA CTA CAA GAT TGT GTC AAT GT - #T GGC ACC AAA GAG GGC     11832     Ser Lys Arg Leu Leu Gln Asp Cys Val Asn Va - #l Gly Thr Lys Glu Gly     #      38205     - AAC TGG CTG GTC AAT GCA GAC AGA CTA GTG AG - #T AGT AAG ACA GGA AAC     11880     Asn Trp Leu Val Asn Ala Asp Arg Leu Val Se - #r Ser Lys Thr Gly Asn     #  38350     - AGG TAT ATA CCT GGA GAG GGC CAC ACC CTA CA - #A GGG AAA CAT TAT GAA     11928     Arg Tyr Ile Pro Gly Glu Gly His Thr Leu Gl - #n Gly Lys His Tyr Glu     #               38553845 - #                3850     - GAA CTG ATA CTG GCA AGG AAA CCG ATC GGT AA - #C TTT GAA GGG ACC GAT     11976     Glu Leu Ile Leu Ala Arg Lys Pro Ile Gly As - #n Phe Glu Gly Thr Asp     #              38705     - AGG TAT AAC TTG GGG CCA ATA GTC AAT GTA GT - #G TTG AGG AGA CTA AAA     12024     Arg Tyr Asn Leu Gly Pro Ile Val Asn Val Va - #l Leu Arg Arg Leu Lys     #          38850     - ATT ATG ATG ATG GCC CTG ATA GGA AGG GGG GT - #G TGAGCATGGT TGGCCCTTGA     12077     Ile Met Met Met Ala Leu Ile Gly Arg Gly Va - #l     #       3895     - TCGGGCCCTA TCAGTAGAAC CCTATTGTAA ATAACATTAA CTTATTAATT AT - #TTAGATAC     12137     - TATTATTTAT TTATTTATTT ATTTATTGAA TGAGCAAGTA CTGGTACAAA CT - #ACCTCATG     12197     - TTACCACACT ACACTCATTT TAACAGCACT TTAGCTGGAG GGAAAACCCT GA - #CGTCCACA     12257     #          12284   TTCC TAACGGC     - (2) INFORMATION FOR SEQ ID NO:2:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 3898 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:     - Met Glu Leu Asn His Phe Glu Leu Leu Tyr Ly - #s Thr Ser Lys Gln Lys     #                 15     - Pro Val Gly Val Glu Glu Pro Val Tyr Asp Th - #r Ala Gly Arg Pro Leu     #             30     - Phe Gly Asn Pro Ser Glu Val His Pro Gln Se - #r Thr Leu Lys Leu Pro     #         45     - His Asp Arg Gly Arg Gly Asp Ile Arg Thr Th - #r Leu Arg Asp Leu Pro     #     60     - Arg Lys Gly Asp Cys Arg Ser Gly Asn His Le - #u Gly Pro Val Ser Gly     # 80     - Ile Tyr Ile Lys Pro Gly Pro Val Tyr Tyr Gl - #n Asp Tyr Thr Gly Pro     #                 95     - Val Tyr His Arg Ala Pro Leu Glu Phe Phe As - #p Glu Ala Gln Phe Cys     #           110     - Glu Val Thr Lys Arg Ile Gly Arg Val Thr Gl - #y Ser Asp Gly Lys Leu     #       125     - Tyr His Ile Tyr Val Cys Val Asp Gly Cys Il - #e Leu Leu Lys Leu Ala     #   140     - Lys Arg Gly Thr Pro Arg Thr Leu Lys Trp Il - #e Arg Asn Phe Thr Asn     145                 1 - #50                 1 - #55                 1 -     #60     - Cys Pro Leu Trp Val Thr Ser Cys Ser Asp As - #p Gly Ala Ser Gly Ser     #               175     - Lys Asp Lys Lys Pro Asp Arg Met Asn Lys Gl - #y Lys Leu Lys Ile Ala     #           190     - Pro Arg Glu His Glu Lys Asp Ser Lys Thr Ly - #s Pro Pro Asp Ala Thr     #       205     - Ile Val Val Glu Gly Val Lys Tyr Gln Ile Ly - #s Lys Lys Gly Lys Val     #   220     - Lys Gly Lys Asn Thr Gln Asp Gly Leu Tyr Hi - #s Asn Lys Asn Lys Pro     225                 2 - #30                 2 - #35                 2 -     #40     - Pro Glu Ser Arg Lys Lys Leu Glu Lys Ala Le - #u Leu Ala Trp Ala Val     #               255     - Ile Thr Ile Leu Leu Tyr Gln Pro Val Ala Al - #a Glu Asn Ile Thr Gln     #           270     - Trp Asn Leu Ser Asp Asn Gly Thr Asn Gly Il - #e Gln Arg Ala Met Tyr     #       285     - Leu Arg Gly Val Asn Arg Ser Leu His Gly Il - #e Trp Pro Glu Lys Ile     #   300     - Cys Lys Gly Val Pro Thr His Leu Ala Thr As - #p Thr Glu Leu Lys Glu     305                 3 - #10                 3 - #15                 3 -     #20     - Ile Arg Gly Met Met Asp Ala Ser Glu Arg Th - #r Asn Tyr Thr Cys Cys     #               335     - Arg Leu Gln Arg His Glu Trp Asn Lys His Gl - #y Trp Cys Asn Trp Tyr     #           350     - Asn Ile Asp Pro Trp Ile Gln Leu Met Asn Ar - #g Thr Gln Thr Asn Leu     #       365     - Thr Glu Gly Pro Pro Asp Lys Glu Cys Ala Va - #l Thr Cys Arg Tyr Asp     #   380     - Lys Asn Thr Asp Val Asn Val Val Thr Gln Al - #a Arg Asn Arg Pro Thr     385                 3 - #90                 3 - #95                 4 -     #00     - Thr Leu Thr Gly Cys Lys Lys Gly Lys Asn Ph - #e Ser Phe Ala Gly Thr     #               415     - Val Ile Glu Gly Pro Cys Asn Phe Asn Val Se - #r Val Glu Asp Ile Leu     #           430     - Tyr Gly Asp His Glu Cys Gly Ser Leu Leu Gl - #n Asp Thr Ala Leu Tyr     #       445     - Leu Leu Asp Gly Met Thr Asn Thr Ile Glu As - #n Ala Arg Gln Gly Ala     #   460     - Ala Arg Val Thr Ser Trp Leu Gly Arg Gln Le - #u Ser Thr Ala Gly Lys     465                 4 - #70                 4 - #75                 4 -     #80     - Lys Leu Glu Arg Arg Ser Lys Thr Trp Phe Gl - #y Ala Tyr Ala Leu Ser     #               495     - Pro Tyr Cys Asn Val Thr Arg Lys Ile Gly Ty - #r Ile Trp Tyr Thr Asn     #           510     - Asn Cys Thr Pro Ala Cys Leu Pro Lys Asn Th - #r Lys Ile Ile Gly Pro     #       525     - Gly Lys Phe Asp Thr Asn Ala Glu Asp Gly Ly - #s Ile Leu His Glu Met     #   540     - Gly Gly His Leu Ser Glu Phe Leu Leu Leu Se - #r Leu Val Ile Leu Ser     545                 5 - #50                 5 - #55                 5 -     #60     - Asp Phe Ala Pro Glu Thr Ala Ser Thr Leu Ty - #r Leu Ile Leu His Tyr     #               575     - Ala Ile Pro Gln Ser His Glu Glu Pro Glu Gl - #y Cys Asp Thr Asn Gln     #           590     - Leu Asn Leu Thr Val Lys Leu Arg Thr Glu As - #p Val Val Pro Ser Ser     #       605     - Val Trp Asn Ile Gly Lys Tyr Val Cys Val Ar - #g Pro Asp Trp Trp Pro     #   620     - Tyr Glu Thr Lys Val Ala Leu Leu Phe Glu Gl - #u Ala Gly Gln Val Ile     625                 6 - #30                 6 - #35                 6 -     #40     - Lys Leu Val Leu Arg Ala Leu Arg Asp Leu Th - #r Arg Val Trp Asn Ser     #               655     - Ala Ser Thr Thr Ala Phe Leu Ile Cys Leu Il - #e Lys Val Leu Arg Gly     #           670     - Gln Val Val Gln Gly Ile Ile Trp Leu Leu Le - #u Val Thr Gly Ala Gln     #       685     - Gly Arg Leu Ala Cys Lys Glu Asp Tyr Arg Ty - #r Ala Ile Ser Ser Thr     #   700     - Asn Glu Ile Gly Leu Leu Gly Ala Glu Gly Le - #u Thr Thr Thr Trp Lys     705                 7 - #10                 7 - #15                 7 -     #20     - Glu Tyr Ser His Gly Leu Gln Leu Asp Asp Gl - #y Thr Val Lys Ala Val     #               735     - Cys Thr Ala Gly Ser Phe Lys Val Thr Ala Le - #u Asn Val Val Ser Arg     #           750     - Arg Tyr Leu Ala Ser Leu His Lys Arg Ala Le - #u Pro Thr Ser Val Thr     #       765     - Phe Glu Leu Leu Phe Asp Gly Thr Asn Pro Al - #a Ile Glu Glu Met Asp     #   780     - Asp Asp Phe Gly Phe Gly Leu Cys Pro Phe As - #p Thr Ser Pro Val Ile     785                 7 - #90                 7 - #95                 8 -     #00     - Lys Gly Lys Tyr Asn Thr Thr Leu Leu Asn Gl - #y Ser Ala Phe Tyr Leu     #               815     - Val Cys Pro Ile Gly Trp Thr Gly Val Val Gl - #u Cys Thr Ala Val Ser     #           830     - Pro Thr Thr Leu Arg Thr Glu Val Val Lys Th - #r Phe Arg Arg Asp Lys     #       845     - Pro Phe Pro His Arg Val Asp Cys Val Thr Th - #r Ile Val Glu Lys Glu     #   860     - Asp Leu Phe His Cys Lys Leu Gly Gly Asn Tr - #p Thr Cys Val Lys Gly     865                 8 - #70                 8 - #75                 8 -     #80     - Asp Pro Val Thr Tyr Lys Gly Gly Gln Val Ly - #s Gln Cys Arg Trp Cys     #               895     - Gly Phe Glu Phe Lys Glu Pro Tyr Gly Leu Pr - #o His Tyr Pro Ile Gly     #           910     - Lys Cys Ile Leu Thr Asn Glu Thr Gly Tyr Ar - #g Val Val Asp Ser Thr     #       925     - Asp Cys Asn Arg Asp Gly Val Val Ile Ser Th - #r Glu Gly Glu His Glu     #   940     - Cys Leu Ile Gly Asn Thr Thr Val Lys Val Hi - #s Ala Leu Asp Glu Arg     945                 9 - #50                 9 - #55                 9 -     #60     - Leu Gly Pro Met Pro Cys Arg Pro Lys Glu Il - #e Val Ser Ser Glu Gly     #               975     - Pro Val Arg Lys Thr Ser Cys Thr Phe Asn Ty - #r Thr Lys Thr Leu Arg     #           990     - Asn Lys Tyr Tyr Glu Pro Arg Asp Ser Tyr Ph - #e Gln Gln Tyr Met Leu     #      10050     - Lys Gly Glu Tyr Gln Tyr Trp Phe Asn Leu As - #p Val Thr Asp His His     #  10205     - Thr Asp Tyr Phe Ala Glu Phe Val Val Leu Va - #l Val Val Ala Leu Leu     #               10401030 - #                1035     - Gly Gly Arg Tyr Val Leu Trp Leu Ile Val Th - #r Tyr Ile Ile Leu Thr     #              10550     - Glu Gln Leu Ala Ala Gly Leu Gln Leu Gly Gl - #n Gly Glu Val Val Leu     #          10705     - Ile Gly Asn Leu Ile Thr His Thr Asp Asn Gl - #u Val Val Val Tyr Phe     #      10850     - Leu Leu Leu Tyr Leu Val Ile Arg Asp Glu Pr - #o Ile Lys Lys Trp Ile     #  11005     - Leu Leu Leu Phe His Ala Met Thr Asn Asn Pr - #o Val Lys Thr Ile Thr     #               11201110 - #                1115     - Val Ala Leu Leu Met Ile Ser Gly Val Ala Ly - #s Gly Gly Lys Ile Asp     #              11350     - Gly Gly Trp Gln Arg Gln Pro Val Thr Ser Ph - #e Asp Ile Gln Leu Ala     #          11505     - Leu Ala Val Val Val Val Val Val Met Leu Le - #u Ala Lys Arg Asp Pro     #      11650     - Thr Thr Phe Pro Leu Val Ile Thr Val Ala Th - #r Leu Arg Thr Ala Lys     #  11805     - Ile Thr Asn Gly Phe Ser Thr Asp Leu Val Il - #e Ala Thr Val Ser Ala     #               12001190 - #                1195     - Ala Leu Leu Thr Trp Thr Tyr Ile Ser Asp Ty - #r Tyr Lys Tyr Lys Thr     #              12150     - Trp Leu Gln Tyr Leu Val Ser Thr Val Thr Gl - #y Ile Phe Leu Ile Arg     #          12305     - Val Leu Lys Gly Ile Gly Glu Leu Asp Leu Hi - #s Ala Pro Thr Leu Pro     #      12450     - Ser His Arg Pro Leu Phe Tyr Ile Leu Val Ty - #r Leu Ile Ser Thr Ala     #  12605     - Val Val Thr Arg Trp Asn Leu Asp Val Ala Gl - #y Leu Leu Leu Gln Cys     #               12801270 - #                1275     - Val Pro Thr Leu Leu Met Val Phe Thr Met Tr - #p Ala Asp Ile Leu Thr     #              12950     - Leu Ile Leu Ile Leu Pro Thr Tyr Glu Leu Th - #r Lys Leu Tyr Tyr Leu     #          13105     - Lys Glu Val Lys Ile Gly Ala Glu Arg Gly Tr - #p Leu Trp Lys Thr Asn     #      13250     - Tyr Lys Arg Val Asn Asp Ile Tyr Glu Val As - #p Gln Thr Ser Glu Gly     #  13405     - Val Tyr Leu Phe Pro Ser Lys Gln Arg Thr Se - #r Ala Ile Thr Ser Thr     #               13601350 - #                1355     - Met Leu Pro Leu Ile Lys Ala Ile Leu Ile Se - #r Cys Ile Ser Asn Lys     #              13750     - Trp Gln Leu Ile Tyr Leu Leu Tyr Leu Ile Ph - #e Glu Val Ser Tyr Tyr     #          13905     - Leu His Lys Lys Val Ile Asp Glu Ile Ala Gl - #y Gly Thr Asn Phe Val     #      14050     - Ser Arg Leu Val Ala Ala Leu Ile Glu Val As - #n Trp Ala Phe Asp Asn     #  14205     - Glu Glu Val Lys Gly Leu Lys Lys Phe Phe Le - #u Leu Ser Ser Arg Val     #               14401430 - #                1435     - Lys Glu Leu Ile Ile Lys His Lys Val Arg As - #n Glu Val Val Val Arg     #              14550     - Trp Phe Gly Asp Glu Glu Ile Tyr Gly Met Pr - #o Lys Leu Ile Gly Leu     #          14705     - Val Lys Ala Ala Thr Leu Ser Arg Asn Lys Hi - #s Cys Met Leu Cys Thr     #      14850     - Val Cys Glu Asp Arg Asp Trp Arg Gly Glu Th - #r Cys Pro Lys Cys Gly     #  15005     - Arg Phe Gly Pro Pro Val Val Cys Gly Met Th - #r Leu Ala Asp Phe Glu     #               15201510 - #                1515     - Glu Lys His Tyr Lys Arg Ile Phe Ile Arg Gl - #u Asp Gln Ser Gly Gly     #              15350     - Pro Leu Arg Glu Glu His Ala Gly Tyr Leu Gl - #n Tyr Lys Ala Arg Gly     #          15505     - Gln Leu Phe Leu Arg Asn Leu Pro Val Leu Al - #a Thr Lys Val Lys Met     #      15650     - Leu Leu Val Gly Asn Leu Gly Thr Glu Ile Gl - #y Asp Leu Glu His Leu     #  15805     - Gly Trp Val Leu Arg Gly Pro Ala Val Cys Ly - #s Lys Val Thr Glu His     #               16001590 - #                1595     - Glu Arg Cys Thr Thr Ser Ile Met Asp Lys Le - #u Thr Ala Phe Phe Gly     #              16150     - Val Met Pro Arg Gly Thr Thr Pro Arg Ala Pr - #o Val Arg Phe Pro Thr     #          16305     - Ser Leu Leu Lys Ile Arg Arg Gly Leu Glu Th - #r Gly Trp Ala Tyr Thr     #      16450     - His Gln Gly Gly Ile Ser Ser Val Asp His Va - #l Thr Cys Gly Lys Asp     #  16605     - Leu Leu Val Cys Asp Thr Met Gly Arg Thr Ar - #g Val Val Cys Gln Ser     #               16801670 - #                1675     - Asn Asn Lys Met Thr Asp Glu Ser Glu Tyr Gl - #y Val Lys Thr Asp Ser     #              16950     - Gly Cys Pro Glu Gly Ala Arg Cys Tyr Val Ph - #e Asn Pro Glu Ala Val     #          17105     - Asn Ile Ser Gly Thr Lys Gly Ala Met Val Hi - #s Leu Gln Lys Thr Gly     #      17250     - Gly Glu Phe Thr Cys Val Thr Ala Ser Gly Th - #r Pro Ala Phe Phe Asp     #  17405     - Leu Lys Asn Leu Lys Gly Trp Ser Gly Leu Pr - #o Ile Phe Glu Ala Ser     #               17601750 - #                1755     - Ser Gly Arg Val Val Gly Arg Val Lys Val Gl - #y Lys Asn Glu Asp Ser     #              17750     - Lys Pro Thr Lys Leu Met Ser Gly Ile Gln Th - #r Val Ser Lys Ser Ala     #          17905     - Thr Asp Leu Thr Glu Met Val Lys Lys Ile Th - #r Thr Met Asn Arg Gly     #      18050     - Glu Phe Arg Gln Ile Thr Leu Ala Thr Gly Al - #a Gly Lys Thr Thr Glu     #  18205     - Leu Pro Arg Ser Val Ile Glu Glu Ile Gly Ar - #g His Lys Arg Val Leu     #               18401830 - #                1835     - Val Leu Ile Pro Leu Arg Ala Ala Ala Glu Se - #r Val Tyr Gln Tyr Met     #              18550     - Arg Gln Lys His Pro Ser Ile Ala Phe Asn Le - #u Arg Ile Gly Glu Met     #          18705     - Lys Glu Gly Asp Met Ala Thr Gly Ile Thr Ty - #r Ala Ser Tyr Gly Tyr     #      18850     - Phe Cys Gln Met Ser Gln Pro Lys Leu Arg Al - #a Ala Met Val Glu Tyr     #  19005     - Ser Phe Ile Phe Leu Asp Glu Tyr His Cys Al - #a Thr Pro Glu Gln Leu     #               19201910 - #                1915     - Ala Ile Met Gly Lys Ile His Arg Phe Ser Gl - #u Asn Leu Arg Val Val     #              19350     - Ala Met Thr Ala Thr Pro Ala Gly Thr Val Th - #r Thr Thr Gly Gln Lys     #          19505     - His Pro Ile Glu Glu Phe Ile Ala Pro Glu Va - #l Met Lys Gly Glu Asp     #      19650     - Leu Gly Ser Glu Tyr Leu Asp Ile Ala Gly Le - #u Lys Ile Pro Val Glu     #  19805     - Glu Met Lys Asn Asn Met Leu Val Phe Val Pr - #o Thr Arg Asn Met Ala     #               20001990 - #                1995     - Val Glu Ala Ala Lys Lys Leu Lys Ala Lys Gl - #y Tyr Asn Ser Gly Tyr     #              20150     - Tyr Tyr Ser Gly Glu Asp Pro Ser Asn Leu Ar - #g Val Val Thr Ser Gln     #          20305     - Ser Pro Tyr Val Val Val Ala Thr Asn Ala Il - #e Glu Ser Gly Val Thr     #      20450     - Leu Pro Asp Leu Asp Val Val Val Asp Thr Gl - #y Leu Lys Cys Glu Lys     #  20605     - Arg Ile Arg Leu Ser Pro Lys Met Pro Phe Il - #e Val Thr Gly Leu Lys     #               20802070 - #                2075     - Arg Met Ala Val Thr Ile Gly Glu Gln Ala Gl - #n Arg Arg Gly Arg Val     #              20950     - Gly Arg Val Lys Pro Gly Arg Tyr Tyr Arg Se - #r Gln Glu Thr Pro Val     #          21105     - Gly Ser Lys Asp Tyr His Tyr Asp Leu Leu Gl - #n Ala Gln Arg Tyr Gly     #      21250     - Ile Glu Asp Gly Ile Asn Ile Thr Lys Ser Ph - #e Arg Glu Met Asn Tyr     #  21405     - Asp Trp Ser Leu Tyr Glu Glu Asp Ser Leu Me - #t Ile Thr Gln Leu Glu     #               21602150 - #                2155     - Ile Leu Asn Asn Leu Leu Ile Ser Glu Glu Le - #u Pro Met Ala Val Lys     #              21750     - Asn Ile Met Ala Arg Thr Asp His Pro Glu Pr - #o Ile Gln Leu Ala Tyr     #          21905     - Asn Ser Tyr Glu Thr Gln Val Pro Val Leu Ph - #e Pro Lys Ile Arg Asn     #      22050     - Gly Glu Val Thr Asp Thr Tyr Asp Asn Tyr Th - #r Phe Leu Asn Ala Arg     #  22205     - Lys Leu Gly Asp Asp Val Pro Pro Tyr Val Ty - #r Ala Thr Glu Asp Glu     #               22402230 - #                2235     - Asp Leu Ala Val Glu Leu Leu Gly Leu Asp Tr - #p Pro Asp Pro Gly Asn     #              22550     - Gln Gly Thr Val Glu Ala Gly Arg Ala Leu Ly - #s Gln Val Val Gly Leu     #          22705     - Ser Thr Ala Glu Asn Ala Leu Leu Val Ala Le - #u Phe Gly Tyr Val Gly     #      22850     - Tyr Gln Ala Leu Ser Lys Arg His Ile Pro Va - #l Val Thr Asp Ile Tyr     #  23005     - Ser Val Glu Asp His Arg Leu Glu Asp Thr Th - #r His Leu Gln Tyr Ala     #               23202310 - #                2315     - Pro Asn Ala Ile Lys Thr Glu Gly Lys Glu Th - #r Glu Leu Lys Glu Leu     #              23350     - Ala Gln Gly Asp Val Gln Arg Cys Val Glu Al - #a Val Thr Asn Tyr Ala     #          23505     - Arg Glu Gly Ile Gln Phe Met Lys Ser Gln Al - #a Leu Lys Val Arg Glu     #      23650     - Thr Pro Thr Tyr Lys Glu Thr Met Asn Thr Va - #l Ala Asp Tyr Val Lys     #  23805     - Lys Phe Ile Glu Ala Leu Thr Asp Ser Lys Gl - #u Asp Ile Ile Lys Tyr     #               24002390 - #                2395     - Gly Leu Trp Gly Ala His Thr Ala Leu Tyr Ly - #s Ser Ile Gly Ala Arg     #              24150     - Leu Gly His Glu Thr Ala Phe Ala Thr Leu Va - #l Val Lys Trp Leu Ala     #          24305     - Phe Gly Gly Glu Ser Ile Ser Asp His Ile Ly - #s Gln Ala Ala Thr Asp     #      24450     - Leu Val Val Tyr Tyr Ile Ile Asn Arg Pro Gl - #n Phe Pro Gly Asp Thr     #  24605     - Glu Thr Gln Gln Glu Gly Arg Lys Phe Val Al - #a Ser Leu Leu Val Ser     #               24802470 - #                2475     - Ala Leu Ala Thr Tyr Thr Tyr Lys Ser Trp As - #n Tyr Asn Asn Leu Ser     #              24950     - Lys Ile Val Glu Pro Ala Leu Ala Thr Leu Pr - #o Tyr Ala Ala Lys Ala     #          25105     - Leu Lys Leu Phe Ala Pro Thr Arg Leu Glu Se - #r Val Val Ile Leu Ser     #      25250     - Thr Ala Ile Tyr Lys Thr Tyr Leu Ser Ile Ar - #g Arg Gly Lys Ser Asp     #  25405     - Gly Leu Leu Gly Thr Gly Val Ser Ala Ala Me - #t Glu Ile Met Ser Gln     #               25602550 - #                2555     - Asn Pro Val Ser Val Gly Ile Ala Val Met Le - #u Gly Val Gly Ala Val     #              25750     - Ala Ala His Asn Ala Ile Glu Ala Ser Glu Gl - #n Lys Arg Thr Leu Leu     #          25905     - Met Lys Val Phe Val Lys Asn Phe Leu Asp Gl - #n Ala Ala Thr Asp Glu     #      26050     - Leu Val Lys Glu Ser Pro Glu Lys Ile Ile Me - #t Ala Leu Phe Glu Ala     #  26205     - Val Gln Thr Val Gly Asn Pro Leu Arg Leu Va - #l Tyr His Leu Tyr Gly     #               26402630 - #                2635     - Val Phe Tyr Lys Gly Trp Glu Ala Lys Glu Le - #u Ala Gln Arg Thr Ala     #              26550     - Gly Arg Asn Leu Phe Thr Leu Ile Met Phe Gl - #u Ala Val Glu Leu Leu     #          26705     - Gly Val Asp Ser Glu Gly Lys Ile Arg Gln Le - #u Ser Ser Asn Tyr Ile     #      26850     - Leu Glu Leu Leu Tyr Lys Phe Arg Asp Asn Il - #e Lys Ser Ser Val Arg     #  27005     - Glu Ile Ala Ile Ser Trp Ala Pro Ala Pro Ph - #e Ser Cys Asp Trp Thr     #               27202710 - #                2715     - Pro Thr Asp Asp Arg Ile Gly Leu Pro His As - #p Asn Tyr Leu Arg Val     #              27350     - Glu Thr Lys Cys Pro Cys Gly Tyr Arg Met Ly - #s Ala Val Lys Asn Cys     #          27505     - Ala Gly Glu Leu Arg Leu Leu Glu Glu Gly Gl - #y Ser Phe Leu Cys Arg     #      27650     - Asn Lys Phe Gly Arg Gly Ser Gln Asn Tyr Ar - #g Val Thr Lys Tyr Tyr     #  27805     - Asp Asp Asn Leu Ser Glu Ile Lys Pro Val Il - #e Arg Met Glu Gly His     #               28002790 - #                2795     - Val Glu Leu Tyr Tyr Lys Gly Ala Thr Ile Ly - #s Leu Asp Phe Asn Asn     #              28150     - Ser Lys Thr Val Leu Ala Thr Asp Lys Trp Gl - #u Val Asp His Ser Thr     #          28305     - Leu Val Arg Ala Leu Lys Arg Tyr Thr Gly Al - #a Gly Tyr Arg Gly Ala     #      28450     - Tyr Leu Gly Glu Lys Pro Asn His Lys His Le - #u Ile Gln Arg Asp Cys     #  28605     - Ala Thr Ile Thr Lys Asp Lys Val Cys Phe Il - #e Lys Met Lys Arg Gly     #               28802870 - #                2875     - Cys Ala Phe Thr Tyr Asp Leu Ser Leu His As - #n Leu Thr Arg Leu Ile     #              28950     - Glu Leu Val His Lys Asn Asn Leu Glu Asp Ar - #g Glu Ile Pro Ala Val     #          29105     - Thr Val Thr Thr Trp Leu Ala Tyr Thr Phe Va - #l Asn Glu Asp Ile Gly     #      29250     - Thr Ile Lys Pro Thr Phe Gly Glu Lys Val Th - #r Pro Glu Lys Gln Glu     #  29405     - Glu Val Val Leu Gln Pro Ala Val Val Val As - #p Thr Thr Asp Val Ala     #               29602950 - #                2955     - Val Thr Val Val Gly Glu Thr Ser Thr Met Th - #r Thr Gly Glu Thr Pro     #              29750     - Thr Thr Phe Thr Ser Leu Gly Ser Asp Ser Ly - #s Val Arg Gln Val Leu     #          29905     - Lys Leu Gly Val Asp Asp Gly Gln Tyr Pro Gl - #y Pro Asn Gln Gln Arg     #      30050     - Ala Ser Leu Leu Glu Ala Ile Gln Gly Val As - #p Glu Arg Pro Ser Val     #  30205     - Leu Ile Leu Gly Ser Asp Lys Ala Thr Ser As - #n Arg Val Lys Thr Ala     #               30403030 - #                3035     - Lys Asn Val Lys Ile Tyr Arg Ser Arg Asp Pr - #o Leu Glu Leu Arg Glu     #              30550     - Met Met Lys Arg Gly Lys Ile Leu Val Val Al - #a Leu Ser Arg Val Asp     #          30705     - Thr Ala Leu Leu Lys Phe Val Asp Tyr Lys Gl - #y Thr Phe Leu Thr Arg     #      30850     - Glu Thr Leu Glu Ala Leu Ser Leu Gly Lys Pr - #o Lys Lys Arg Asp Ile     #  31005     - Thr Lys Ala Glu Ala Gln Trp Leu Leu Arg Le - #u Glu Asp Gln Ile Glu     #               31203110 - #                3115     - Glu Leu Pro Asp Trp Phe Ala Ala Lys Glu Pr - #o Ile Phe Leu Glu Ala     #              31350     - Asn Ile Lys Arg Asp Lys Tyr His Leu Val Gl - #y Asp Ile Ala Thr Ile     #          31505     - Lys Glu Lys Ala Lys Gln Leu Gly Ala Thr As - #p Ser Thr Lys Ile Ser     #      31650     - Lys Glu Val Gly Ala Lys Val Tyr Ser Met Ly - #s Leu Ser Asn Trp Val     #  31805     - Ile Gln Glu Glu Asn Lys Gln Gly Ser Leu Al - #a Pro Leu Phe Glu Glu     #               32003190 - #                3195     - Leu Leu Gln Gln Cys Pro Pro Gly Gly Gln As - #n Lys Thr Thr His Met     #              32150     - Val Ser Ala Tyr Gln Leu Ala Gln Gly Asn Tr - #p Val Pro Val Ser Cys     #          32305     - His Val Phe Met Gly Thr Ile Pro Ala Arg Ar - #g Thr Lys Thr His Pro     #      32450     - Tyr Glu Ala Tyr Val Lys Leu Arg Glu Leu Va - #l Asp Glu His Lys Met     #  32605     - Lys Ala Leu Cys Gly Gly Ser Gly Leu Ser Ly - #s His Asn Glu Trp Val     #               32803270 - #                3275     - Ile Gly Lys Val Lys Tyr Gln Gly Asn Leu Ar - #g Thr Lys His Met Leu     #              32950     - Asn Pro Gly Lys Val Ala Glu Gln Leu His Ar - #g Glu Gly Tyr Arg His     #          33105     - Asn Val Tyr Asn Lys Thr Ile Gly Ser Val Me - #t Thr Ala Thr Gly Ile     #      33250     - Arg Leu Glu Lys Leu Pro Val Val Arg Ala Gl - #n Thr Asp Thr Thr Asn     #  33405     - Phe His Gln Ala Ile Arg Asp Lys Ile Asp Ly - #s Glu Glu Asn Leu Gln     #               33603350 - #                3355     - Thr Pro Gly Leu His Lys Lys Leu Met Glu Va - #l Phe Asn Ala Leu Lys     #              33750     - Arg Pro Glu Leu Glu Ala Ser Tyr Asp Ala Va - #l Asp Trp Glu Glu Leu     #          33905     - Glu Arg Gly Ile Asn Arg Lys Gly Ala Ala Gl - #y Phe Phe Glu Arg Lys     #      34050     - Asn Ile Gly Glu Val Leu Asp Ser Glu Lys As - #n Lys Val Glu Glu Val     #  34205     - Ile Asp Ser Leu Lys Lys Gly Arg Asn Ile Ar - #g Tyr Tyr Glu Thr Ala     #               34403430 - #                3435     - Ile Pro Lys Asn Glu Lys Arg Asp Val Asn As - #p Asp Trp Thr Ala Gly     #              34550     - Asp Phe Val Asp Glu Lys Lys Pro Arg Val Il - #e Gln Tyr Pro Glu Ala     #          34705     - Lys Thr Arg Leu Ala Ile Thr Lys Val Met Ty - #r Lys Trp Val Lys Gln     #      34850     - Lys Pro Val Val Ile Pro Gly Tyr Glu Gly Ly - #s Thr Pro Leu Phe Gln     #  35005     - Ile Phe Asp Lys Val Lys Lys Glu Trp Asp Gl - #n Phe Gln Asn Pro Val     #               35203510 - #                3515     - Ala Val Ser Phe Asp Thr Lys Ala Trp Asp Th - #r Gln Val Thr Thr Arg     #              35350     - Asp Leu Glu Leu Ile Arg Asp Ile Gln Lys Ph - #e Tyr Phe Lys Lys Lys     #          35505     - Trp His Lys Phe Ile Asp Thr Leu Thr Lys Hi - #s Met Ser Glu Val Pro     #      35650     - Val Ile Ser Ala Asp Gly Glu Val Tyr Ile Ar - #g Lys Gly Gln Arg Gly     #  35805     - Ser Gly Gln Pro Asp Thr Ser Ala Gly Asn Se - #r Met Leu Asn Val Leu     #               36003590 - #                3595     - Thr Met Val Tyr Ala Phe Cys Glu Ala Thr Gl - #y Val Pro Tyr Lys Ser     #              36150     - Phe Asp Arg Val Ala Lys Ile His Val Cys Gl - #y Asp Asp Gly Phe Leu     #          36305     - Ile Thr Glu Arg Ala Leu Gly Glu Lys Phe Al - #a Ser Lys Gly Val Gln     #      36450     - Ile Leu Tyr Glu Ala Gly Lys Pro Gln Lys Il - #e Thr Glu Gly Asp Lys     #  36605     - Met Lys Val Ala Tyr Gln Phe Asp Asp Ile Gl - #u Phe Cys Ser His Thr     #               36803670 - #                3675     - Pro Val Gln Val Arg Trp Ser Asp Asn Thr Se - #r Ser Tyr Met Pro Gly     #              36950     - Arg Asn Thr Thr Thr Ile Leu Ala Lys Met Al - #a Thr Arg Leu Asp Ser     #          37105     - Ser Gly Glu Arg Gly Thr Ile Ala Tyr Glu Ly - #s Ala Val Ala Phe Ser     #      37250     - Phe Leu Leu Met Tyr Ser Trp Asn Pro Leu Il - #e Arg Arg Ile Cys Leu     #  37405     - Leu Val Leu Ser Thr Glu Leu Gln Val Arg Pr - #o Gly Lys Ser Thr Thr     #               37603750 - #                3755     - Tyr Tyr Tyr Glu Gly Asp Pro Ile Ser Ala Ty - #r Lys Glu Val Ile Gly     #              37750     - His Asn Leu Phe Asp Leu Lys Arg Thr Ser Ph - #e Glu Lys Leu Ala Lys     #          37905     - Leu Asn Leu Ser Met Ser Thr Leu Gly Val Tr - #p Thr Arg His Thr Ser     #      38050     - Lys Arg Leu Leu Gln Asp Cys Val Asn Val Gl - #y Thr Lys Glu Gly Asn     #  38205     - Trp Leu Val Asn Ala Asp Arg Leu Val Ser Se - #r Lys Thr Gly Asn Arg     #               38403830 - #                3835     - Tyr Ile Pro Gly Glu Gly His Thr Leu Gln Gl - #y Lys His Tyr Glu Glu     #              38550     - Leu Ile Leu Ala Arg Lys Pro Ile Gly Asn Ph - #e Glu Gly Thr Asp Arg     #          38705     - Tyr Asn Leu Gly Pro Ile Val Asn Val Val Le - #u Arg Arg Leu Lys Ile     #      38850     - Met Met Met Ala Leu Ile Gly Arg Gly Val     #   3895     - (2) INFORMATION FOR SEQ ID NO:3:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 33 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ix) FEATURE:               (A) NAME/KEY:               (B) LOCATION: 1..33     #/label= primer.sub.-- 1RMATION:     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:     #         33       AGTG CTGTGACTTT AAA     - (2) INFORMATION FOR SEQ ID NO:4:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 39 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ix) FEATURE:               (A) NAME/KEY:               (B) LOCATION: 1..39     #/label= primer.sub.-- 2RMATION:     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:     #    39            GTGG GGCTCACTGC TGTGCACTC     - (2) INFORMATION FOR SEQ ID NO:5:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 16 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ix) FEATURE:               (A) NAME/KEY:               (B) LOCATION: 1..16     #/label= Adaptor.sub.-- 1MATION:      Hinf I adaptor,f Bam HI     #ATG at 364-366"ontaining     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:     #    16     - (2) INFORMATION FOR SEQ ID NO:6:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 16 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ix) FEATURE:               (A) NAME/KEY:               (B) LOCATION: 1..16     #/label= Adaptor.sub.-- 2MATION:      Hinf I adaptor,f Bam HI     #ATG at 364-366"ontaining     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:     #    16     - (2) INFORMATION FOR SEQ ID NO:7:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 10 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: double               (D) TOPOLOGY: linear     -     (ix) FEATURE:               (A) NAME/KEY:               (B) LOCATION: 1..10     #/label= Adaptor.sub.-- 3MATION:      Eco RI bluntnded Stu I                    adaptor"     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:     #        10     - (2) INFORMATION FOR SEQ ID NO:8:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 21 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ix) FEATURE:               (A) NAME/KEY:               (B) LOCATION: 1..21     #/label= Adaptor.sub.-- 4MATION:      BamH I adaptor"f Bgl II     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8:     #21                CCTG T     - (2) INFORMATION FOR SEQ ID NO:9:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 14 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ix) FEATURE:               (A) NAME/KEY:               (B) LOCATION: 1..14     #/label= Adaptor.sub.-- 5MATION:      BamH I adaptor"f Bgl II     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9:     #     14     - (2) INFORMATION FOR SEQ ID NO:10:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 15 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ix) FEATURE:               (A) NAME/KEY:               (B) LOCATION: 1..15     #/label= Adaptor.sub.-- 6MATION:      Eco R I adaptor" Ban I     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:10:     #    15     - (2) INFORMATION FOR SEQ ID NO:11:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 15 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ix) FEATURE:               (A) NAME/KEY:               (B) LOCATION: 1..15     #/label= Adaptor.sub.-- 7MATION:      Eco R I adaptor" Ban I     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:11:     #    15     - (2) INFORMATION FOR SEQ ID NO:12:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 300 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: double               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: cDNA     -    (vii) IMMEDIATE SOURCE:     #clone    (B) CLONE: lambda gt11     -     (ix) FEATURE:               (A) NAME/KEY: CDS               (B) LOCATION: 1..300     #/note= "Part of 0.8 kb insert of                    Lambda gt - #11"     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:12:     - AGT GAC AAC GGC ACT AAT GGT ATT CAG CGA GC - #C ATG TAT CTT AGA GGG       48     Ser Asp Asn Gly Thr Asn Gly Ile Gln Arg Al - #a Met Tyr Leu Arg Gly     #                 15     - GTT AAC AGG AGC TTA CAT GGG ATC TGG CCC GA - #G AAA ATA TGC AAG GGG       96     Val Asn Arg Ser Leu His Gly Ile Trp Pro Gl - #u Lys Ile Cys Lys Gly     #             30     - GTC CCC ACT CAT CTG GCC ACT GAC ACG GAA CT - #G AAA GAG ATA CGC GGG      144     Val Pro Thr His Leu Ala Thr Asp Thr Glu Le - #u Lys Glu Ile Arg Gly     #         45     - ATG ATG GAT GCC AGC GAG AGG ACA AAC TAT AC - #G TGC TGT AGG TTA CAA      192     Met Met Asp Ala Ser Glu Arg Thr Asn Tyr Th - #r Cys Cys Arg Leu Gln     #     60     - AGA CAT GAA TGG AAC AAA CAT GGA TGG TGT AA - #C TGG TAC AAC ATA GAC      240     Arg His Glu Trp Asn Lys His Gly Trp Cys As - #n Trp Tyr Asn Ile Asp     # 80     - CCT TGG ATT CAG TTA ATG AAC AGG ACC CAA AC - #A AAT TTG ACA GAA GGC      288     Pro Trp Ile Gln Leu Met Asn Arg Thr Gln Th - #r Asn Leu Thr Glu Gly     #                 95     #      300     Pro Pro Asp Lys                 100     - (2) INFORMATION FOR SEQ ID NO:13:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 100 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13:     - Ser Asp Asn Gly Thr Asn Gly Ile Gln Arg Al - #a Met Tyr Leu Arg Gly     #                 15     - Val Asn Arg Ser Leu His Gly Ile Trp Pro Gl - #u Lys Ile Cys Lys Gly     #             30     - Val Pro Thr His Leu Ala Thr Asp Thr Glu Le - #u Lys Glu Ile Arg Gly     #         45     - Met Met Asp Ala Ser Glu Arg Thr Asn Tyr Th - #r Cys Cys Arg Leu Gln     #     60     - Arg His Glu Trp Asn Lys His Gly Trp Cys As - #n Trp Tyr Asn Ile Asp     # 80     - Pro Trp Ile Gln Leu Met Asn Arg Thr Gln Th - #r Asn Leu Thr Glu Gly     #                 95     - Pro Pro Asp Lys                 100     __________________________________________________________________________ 

What is claimed is:
 1. An isolated DNA sequence which encodes the 55 kD protein of hog cholera virus (HCV) or an antigenic fragment thereof comprising the amino acid sequence from about amino acid 812 to about amino acid 859 of SEQ ID NO:2.
 2. The DNA according to claim 1, which encodes a polypeptide comprising the amino acid sequence from about 689 to about 1067 of SEQ ID NO:2 or an antigenic fragment thereof comprising the amino acid sequence from about 812 to about 859 of SEQ ID NO:2.
 3. The DNA according to claim 1, which comprises the DNA sequence from about 2428 to about 3584 of SEQ ID NO:1, or a subsequence thereof comprising the DNA sequence from about 2799 to about 2938 of SEQ ID NO:1.
 4. A recombinant nucleic acid molecule comprising a vector nucleic acid molecule and a DNA sequence according to claim
 1. 5. The recombinant nucleic acid molecule according to claim 4, wherein the DNA sequence is operably linked to expression control sequences.
 6. A host cell comprising the recombinant nucleic acid molecule according to claim
 4. 7. A host cell according to claim 6, wherein the host cell is a bacterium.
 8. A recombinant virus, comprising the nucleic acid molecule of claim
 4. 9. The recombinant virus of claim 8, wherein the virus is pseudorabies virus or vaccinia.
 10. A vaccine for the protection of animals against hog cholera virus infection, comprising a host cell according to claim
 6. 11. A vaccine for the protection of animals against hog cholera virus infection, comprising a recombinant virus according to claim
 8. 12. A method for the preparation of a hog cholera virus vaccine, comprising growing a host cell according to claim 6 in culture, harvesting the cells and mixing the cells with a pharmaceutically acceptable carrier.
 13. An isolated hog cholera virus polypeptide, comprising amino acids 689 to 1067 of SEQ ID NO:2, or an antigenic fragment thereof comprising amino acids 812 to 859 of SEQ ID NO:2.
 14. An isolated hog cholera virus polypeptide, which is expressed by the host cell of claim
 6. 15. A vaccine for the protection of animals against hog cholera virus infection, comprising a polypeptide according to claim
 13. 16. A method for the preparation of a hog cholera virus vaccine, comprising mixing an immunogenically effective amount of a polypeptide according to claim 13 with a pharmaceutically acceptable carrier.
 17. A method for the preparation of a 55 kD protein of HCV or an antigenic fragment thereof, which comprises culturing the host cells of claim 6, expressing the 55 kD protein or fragment, and recovering the protein or fragment from the culture.
 18. A method for the preparation of a 55 kD protein of HCV or an antigenic fragment thereof, which comprises culturing cells infected with the recombinant virus according to claim 8, expressing the 55 kD protein or fragment, and recovering the protein or fragment from the culture. 